Unsupervised Domain Adaptation for I-Vector Speaker Recognition

Daniel Garcia-Romero, Alan McCree, Stephen Shum, Niko Brummer and Carlos Vaquero

In this paper, we present a framework for unsupervised domain adaptation of PLDA based i-vector speaker recognition systems. Given an existing out-of-domain PLDA system, we use it to cluster unlabeled in-domain data, and then use this data to adapt the parameters of the PLDA system. We explore two versions of agglomerative hierarchical clustering that use the PLDA system. We also study two automatic ways to determine the number of clusters in the in-domain dataset. The proposed techniques are experimentally validated in the recently introduced domain adaptation challenge. This challenge provides a very useful setup to explore domain adaptation since it illustrates a significant performance gap between an in-domain and out-of-domain system. Using agglomerative hierarchical clustering with a stopping criterion based on unsupervised calibration we are able to recover 85% of this gap.

Odyssey 2014

The Speaker and Language Recognition Workshop

Unsupervised Domain Adaptation for I-Vector Speaker Recognition

Search in Audio

Speech Transcript

Related Recordings

Unsupervised Clustering Approaches for Domain Adaptation in Speaker Recognition Systems

Generative pairwise models for speaker recognition