InterSpeech 2021

Speech Localization, Enhancement, and Quality Assessment

PILOT: Introducing Transformers for Probabilistic Sound Event Localization
(3 minutes introduction)

Christopher Schymura (Ruhr-Universität Bochum, Germany), Benedikt Bönninghoff (Ruhr-Universität Bochum, Germany), Tsubasa Ochiai (NTT, Japan), Marc Delcroix (NTT, Japan), Keisuke Kinoshita (NTT, Japan), Tomohiro Nakatani (NTT, Japan), Shoko Araki (NTT, Japan), Dorothea Kolossa (Ruhr-Universität Bochum, Germany)

Assessment of von Mises--Bernoulli Deep Neural Network in Sound Source Localization
(3 minutes introduction)

Katsutoshi Itoyama (Tokyo Tech, Japan), Yoshiya Morimoto (Tokyo Tech, Japan), Shungo Masaki (Tokyo Tech, Japan), Ryosuke Kojima (Kyoto University, Japan), Kenji Nishida (Tokyo Tech, Japan), Kazuhiro Nakadai (Tokyo Tech, Japan)

Far-field Speaker Localization and Adaptive GLMB Tracking
(3 minutes introduction)

Shoufeng Lin (Curtin University, Australia), Zhaojie Luo (Osaka University, Japan)

Far-field Speaker Localization and Adaptive GLMB Tracking
(longer introduction)

Shoufeng Lin (Curtin University, Australia), Zhaojie Luo (Osaka University, Japan)