InterSpeech 2021

Speech Recognition of Atypical Speech

Automatic Speech Recognition of Disordered Speech: Personalized models outperforming human listeners on short phrases
(Oral presentation)

Jordan R. Green (MGH Institute of Health Professions, USA), Robert L. MacDonald (Google, USA), Pan-Pan Jiang (Google, USA), Julie Cattiau (Google, USA), Rus Heywood (Google, USA), Richard Cave (MND Association, UK), Katie Seaver (MGH Institute of Health Professions, USA), Marilyn A. Ladewig (Cerebral Palsy Associations of New York State, USA), Jimmy Tobin (Google, USA), Michael P. Brenner (Google, USA), Philip C. Nelson (Google, USA), Katrin Tomanek (Google, USA)

Investigating the Utility of Multimodal Conversational Technology and Audiovisual Analytic Measures for the Assessment and Monitoring of Amyotrophic Lateral Sclerosis at Scale
(Oral presentation)

Michael Neumann (Modality.AI, USA), Oliver Roesler (Modality.AI, USA), Jackson Liscombe (Modality.AI, USA), Hardik Kothare (Modality.AI, USA), David Suendermann-Oeft (Modality.AI, USA), David Pautler (Modality.AI, USA), Indu Navar (Peter Cohen Foundation, USA), Aria Anvar (Peter Cohen Foundation, USA), Jochen Kumm (Pr3vent, USA), Raquel Norel (IBM, USA), Ernest Fraenkel (MIT, USA), Alexander V. Sherman (MGH Institute of Health Professions, USA), James D. Berry (MGH Institute of Health Professions, USA), Gary L. Pattee (University of Nebraska, USA), Jun Wang (University of Texas at Austin, USA), Jordan R. Green (MGH Institute of Health Professions, USA), Vikram Ramanarayanan (Modality.AI, USA)

Handling acoustic variation in dysarthric speech recognition systems through model combination
(Oral presentation)

Enno Hermann (Idiap Research Institute, Switzerland), Mathew Magimai-Doss (Idiap Research Institute, Switzerland)

Adversarial Data Augmentation for Disordered Speech Recognition
(Oral presentation)

Zengrui Jin (CUHK, China), Mengzhe Geng (CUHK, China), Xurong Xie (CAS, China), Jianwei Yu (CUHK, China), Shansong Liu (CUHK, China), Xunying Liu (CUHK, China), Helen Meng (CUHK, China)

Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
(Oral presentation)

Xurong Xie (CAS, China), Rukiye Ruzi (CAS, China), Xunying Liu (CUHK, China), Lan Wang (CAS, China)

Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition
(Oral presentation)

Jiajun Deng (CUHK, China), Fabian Ritter Gutierrez (CUHK, China), Shoukang Hu (CUHK, China), Mengzhe Geng (CUHK, China), Xurong Xie (CAS, China), Zi Ye (CUHK, China), Shansong Liu (CUHK, China), Jianwei Yu (CUHK, China), Xunying Liu (CUHK, China), Helen Meng (CUHK, China)

A Voice-Activated Switch for Persons with Motor and Speech Impairments: Isolated-Vowel Spotting Using Neural Networks
(Oral presentation)

Shanqing Cai (Google, USA), Lisie Lillianfeld (Google, USA), Katie Seaver (Google, USA), Jordan R. Green (Google, USA), Michael P. Brenner (Google, USA), Philip C. Nelson (Google, USA), D. Sculley (Google, USA)

Conformer Parrotron: a Faster and Stronger End-to-end SpeechConversion and Recognition Model for Atypical Speech
(Oral presentation)

Zhehuai Chen (Google, USA), Bhuvana Ramabhadran (Google, USA), Fadi Biadsy (Google, USA), Xia Zhang (Google, USA), Youzheng Chen (Google, USA), Liyang Jiang (Google, USA), Fang Chu (Google, USA), Rohan Doshi (Google, USA), Pedro J. Moreno (Google, USA)

Disordered Speech Data Collection: Lessons Learned at 1 Million Utterances from Project Euphonia
(Oral presentation)

Robert L. MacDonald (Google, USA), Pan-Pan Jiang (Google, USA), Julie Cattiau (Google, USA), Rus Heywood (Google, USA), Richard Cave (MND Association, UK), Katie Seaver (MGH Institute of Health Professions, USA), Marilyn A. Ladewig (Cerebral Palsy Associations of New York State, USA), Jimmy Tobin (Google, USA), Michael P. Brenner (Google, USA), Philip C. Nelson (Google, USA), Jordan R. Green (MGH Institute of Health Professions, USA), Katrin Tomanek (Google, USA)

Comparing Supervised Models And Learned Speech Representations For Classifying Intelligibility Of Disordered Speech On Selected Phrases
(Oral presentation)

Subhashini Venugopalan (Google, USA), Joel Shor (Google, Japan), Manoj Plakal (Google, USA), Jimmy Tobin (Google, USA), Katrin Tomanek (Google, USA), Jordan R. Green (MGH Institute of Health Professions, USA), Michael P. Brenner (Google, USA)

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
(Oral presentation)

Vikramjit Mitra (Apple, USA), Zifang Huang (Apple, USA), Colin Lea (Apple, USA), Lauren Tooley (Apple, USA), Sarah Wu (Apple, USA), Darren Botten (Apple, USA), Ashwini Palekar (Apple, USA), Shrinath Thelapurath (Apple, USA), Panayiotis Georgiou (Apple, USA), Sachin Kajarekar (Apple, USA), Jefferey Bigham (Apple, USA)