A Semisupervised Approach for Language Identification based on Ladder Networks

Ehud Ben-Reuven, Jacob Goldberger

In this study we address the problem of training a neural-network for language identification using both labeled and unlabeled speech samples in the form of i-vectors. We propose a neural network architecture that can also handle out-of-set languages. We utilize a modified version of the recently proposed Ladder Network semi-supervised training procedure that optimizes the reconstruction costs of a stack of denoising autoencoders. We show that this approach can be successfully applied to the case where the training dataset is composed of both labeled and unlabeled acoustic data. The results show enhanced language identification on the NIST 2015 language identification dataset.

Switch Camera

Odyssey 2016

The Speaker and Language Recognition Workshop

A Semisupervised Approach for Language Identification based on Ladder Networks

Search in Audio

Speech Transcript

Related Recordings

Out-of-Set i-Vector Selection for Open-set Language Identification

I2R Submission to the 2015 NIST Language Recognition I-vector Challenge