InterSpeech 2021

Self-supervision and semi-supervision for neural ASR training

On the Learning Dynamics of Semi-Supervised Training for ASR
(3 minutes introduction)

Electra Wallington (University of Edinburgh, UK), Benji Kershenbaum (University of Edinburgh, UK), Ondřej Klejch (University of Edinburgh, UK), Peter Bell (University of Edinburgh, UK)

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
(3 minutes introduction)

Wei-Ning Hsu (Facebook, USA), Anuroop Sriram (Facebook, USA), Alexei Baevski (Facebook, USA), Tatiana Likhomanenko (Facebook, USA), Qiantong Xu (Facebook, USA), Vineel Pratap (Facebook, USA), Jacob Kahn (Facebook, USA), Ann Lee (Facebook, USA), Ronan Collobert (Facebook, USA), Gabriel Synnaeve (Facebook, France), Michael Auli (Facebook, USA)

Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation
(3 minutes introduction)

Zhehuai Chen (Google, USA), Andrew Rosenberg (Google, USA), Yu Zhang (Google, USA), Heiga Zen (Google, Japan), Mohammadreza Ghodsi (Google, USA), Yinghui Huang (Google, USA), Jesse Emond (Google, USA), Gary Wang (Google, USA), Bhuvana Ramabhadran (Google, USA), Pedro J. Moreno (Google, USA)

Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS
(3 minutes introduction)

Yan Deng (Microsoft, China), Rui Zhao (Microsoft, USA), Zhong Meng (Microsoft, USA), Xie Chen (Microsoft, USA), Bing Liu (Microsoft, China), Jinyu Li (Microsoft, USA), Yifan Gong (Microsoft, USA), Lei He (Microsoft, China)