Binary Neural Network for Speaker Verification <BR>(3 minutes introduction)

Binary Neural Network for Speaker Verification
(3 minutes introduction)

Tinglong Zhu (Duke Kunshan University, China), Xiaoyi Qin (Duke Kunshan University, China), Ming Li (Duke Kunshan University, China)

Although deep neural networks are successful for many tasks in the speech domain, the high computational and memory costs of deep neural networks make it difficult to directly deploy high-performance Neural Network systems on low-resource embedded devices. There are several mechanisms to reduce the size of the neural networks i.e. parameter pruning, parameter quantization, etc. This paper focuses on how to apply binary neural networks to the task of speaker verification. The proposed binarization of training parameters can largely maintain the performance while significantly reducing storage space requirements and computational costs. Experiment results show that, after binarizing the Convolutional Neural Network, the ResNet34-based network achieves an EER of around 5% on the ''Voxceleb1'' testing dataset and even outperforms the traditional real number network on the text-dependent dataset: ''Xiaole'' while having a 32× memory saving.

Search in Audio

Related Recordings

Improving Time Delay Neural Network Based Speaker Recognition With Convolutional Block And Feature Aggregation Methods
(longer introduction)

Yu-Jia Zhang , Yih-Wen Wang , Chia-Ping Chen , Chung-Li Lu , Bo-Cheng Chan

Mutual Information Enhanced Training for Speaker Embedding
(3 minutes introduction)

Youzhi Tu , Man-Wai Mak

InterSpeech 2021

Binary Neural Network for Speaker Verification (3 minutes introduction)

Search in Audio

Related Recordings

Improving Time Delay Neural Network Based Speaker Recognition With Convolutional Block And Feature Aggregation Methods (longer introduction)

Mutual Information Enhanced Training for Speaker Embedding (3 minutes introduction)

Binary Neural Network for Speaker Verification
(3 minutes introduction)

Improving Time Delay Neural Network Based Speaker Recognition With Convolutional Block And Feature Aggregation Methods
(longer introduction)

Mutual Information Enhanced Training for Speaker Embedding
(3 minutes introduction)