InterSpeech 2021

Feature, Embedding and Neural Architecture for Speaker Recognition

Bidirectional Multiscale Feature Aggregation for Speaker Verification
(3 minutes introduction)

Jiajun Qi (USTC, China), Wu Guo (USTC, China), Bin Gu (USTC, China)

Improving Time Delay Neural Network Based Speaker Recognition With Convolutional Block And Feature Aggregation Methods
(3 minutes introduction)

Yu-Jia Zhang (National Sun Yat-sen University, Taiwan), Yih-Wen Wang (National Sun Yat-sen University, Taiwan), Chia-Ping Chen (National Sun Yat-sen University, Taiwan), Chung-Li Lu (Chunghwa Telecom Laboratories, Taiwan), Bo-Cheng Chan (Chunghwa Telecom Laboratories, Taiwan)

Improving Time Delay Neural Network Based Speaker Recognition With Convolutional Block And Feature Aggregation Methods
(longer introduction)

Yu-Jia Zhang (National Sun Yat-sen University, Taiwan), Yih-Wen Wang (National Sun Yat-sen University, Taiwan), Chia-Ping Chen (National Sun Yat-sen University, Taiwan), Chung-Li Lu (Chunghwa Telecom Laboratories, Taiwan), Bo-Cheng Chan (Chunghwa Telecom Laboratories, Taiwan)

Binary Neural Network for Speaker Verification
(3 minutes introduction)

Tinglong Zhu (Duke Kunshan University, China), Xiaoyi Qin (Duke Kunshan University, China), Ming Li (Duke Kunshan University, China)

Y-Vector: Multiscale Waveform Encoder for Speaker Embedding
(3 minutes introduction)

Ge Zhu (University of Rochester, USA), Fei Jiang (University of Rochester, USA), Zhiyao Duan (University of Rochester, USA)

Phoneme-aware and Channel-wise Attentive Learning for Text Dependent Speaker Verification
(3 minutes introduction)

Yan Liu (Xiamen University, China), Zheng Li (Xiamen University, China), Lin Li (Xiamen University, China), Qingyang Hong (Xiamen University, China)

Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
(3 minutes introduction)

Hongning Zhu (NUS, Singapore), Kong Aik Lee (A*STAR, Singapore), Haizhou Li (NUS, Singapore)