On autoencoders in the i-vector space for speaker recognition

Timur Pekhovsky, Sergey Novoselov, Aleksei Sholohov, Oleg Kudashev

We present the detailed empirical investigation of the speaker verification system based on denoising autoencoder (DAE) in the i-vector space firstly proposed in [1]. This paper includes description of this system and discusses practical issues of the system training. The aim of this investigation is to study the properties of DAE in the i-vector space and analyze different strategies of initialization and training of the the back-end parameters. Also in this paper we propose several improvements to our system to increase the accuracy. Finally, we demonstrate potential of the proposed system in the case of domain mismatch. It achieves considerable gain in performance compared to the baseline system for the unsupervised domain adaptation scenario on the NIST 2010 SRE task.

Switch Camera

Odyssey 2016

The Speaker and Language Recognition Workshop

On autoencoders in the i-vector space for speaker recognition

Search in Audio

Speech Transcript

Related Recordings

LID-senone Extraction via Deep Neural Networks for End-to-End Language Identification

Channel Compensation for Speaker Recognition using MAP Adapted PLDA and Denoising DNNs