InterSpeech 2021

Speaker Diarization II

LEAP Submission for the Third DIHARD Diarization Challenge
(3 minutes introduction)

Prachi Singh (Indian Institute of Science, India), Rajat Varma (Indian Institute of Science, India), Venkat Krishnamohan (Indian Institute of Science, India), Srikanth Raj Chetupalli (Indian Institute of Science, India), Sriram Ganapathy (Indian Institute of Science, India)

LEAP Submission for the Third DIHARD Diarization Challenge
(longer introduction)

Prachi Singh (Indian Institute of Science, India), Rajat Varma (Indian Institute of Science, India), Venkat Krishnamohan (Indian Institute of Science, India), Srikanth Raj Chetupalli (Indian Institute of Science, India), Sriram Ganapathy (Indian Institute of Science, India)

Investigation of Spatial-Acoustic Features for Overlapping Speech Detection in Multiparty Meetings
(3 minutes introduction)

Shiliang Zhang (Alibaba, China), Siqi Zheng (Alibaba, China), Weilong Huang (Alibaba, China), Ming Lei (Alibaba, China), Hongbin Suo (Alibaba, China), Jinwei Feng (Alibaba, USA), Zhijie Yan (Alibaba, China)

Target-Speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker
(3 minutes introduction)

Maokui He (USTC, China), Desh Raj (Johns Hopkins University, USA), Zili Huang (Johns Hopkins University, USA), Jun Du (USTC, China), Zhuo Chen (Microsoft, USA), Shinji Watanabe (Johns Hopkins University, USA)

ECAPA-TDNN Embeddings for Speaker Diarization
(3 minutes introduction)

Nauman Dawalatabad (IIT Madras, India), Mirco Ravanelli (Mila, Canada), François Grondin (Université de Sherbrooke, Canada), Jenthe Thienpondt (Ghent University, Belgium), Brecht Desplanques (Ghent University, Belgium), Hwidong Na (Samsung, Korea)

Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech
(3 minutes introduction)

Keisuke Kinoshita (NTT, Japan), Marc Delcroix (NTT, Japan), Naohiro Tawara (NTT, Japan)

Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface
(3 minutes introduction)

Benjamin O’Brien (LPL (UMR 7309), France), Natalia Tomashenko (LIA (EA 4128), France), Anaïs Chanclu (LIA (EA 4128), France), Jean-François Bonastre (LIA (EA 4128), France)

Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface
(longer introduction)

Benjamin O’Brien (LPL (UMR 7309), France), Natalia Tomashenko (LIA (EA 4128), France), Anaïs Chanclu (LIA (EA 4128), France), Jean-François Bonastre (LIA (EA 4128), France)

Speaker Diarization using Two-pass GMM PLDA Clustering of DNN Embeddings
(3 minutes introduction)

Kiran Karra (Johns Hopkins University, USA), Alan McCree (Johns Hopkins University, USA)