Parallel Acoustic Model Adaptation for Improving Phonotactic Language Recognition

SESSION 10: Language recognition – phonotactics

Added: 14. 7. 2010 11:08, Author: Cheung Chi Leung, Bin Ma, Haizhou Li (Institute for Infocomm Research), Length: 0:17:37

In phonotactic language recognition systems, the use of acoustic model adaptation prior to phone lattice decoding has been proposed to deal with the mismatch between training and test conditions. In this paper, a novel approach using diversified phonotactic features from parallel acoustic model adaptation is proposed. Specifically, the parallel model adaptation involves independent mean-only and variance-only MLLR adaptation. A quantitative method to measure the diversity between two sets of high-dimensional phonotactic features is introduced. Our experiment shows that this novel approach achieves an EER of 3.07% in the 30-second condition of the 2007 NIST Language Recognition Evaluation (LRE) tasks. It brings a 17.3% relative improvement in EER over the baseline system using a SAT phone model and CMLLR for model adaptation.

  Speech Transcript



Please sign in to post your comment!

  Lecture Information

Number of views: 387
Video resolution: 720x576 px
Audio track: MP3 [6.05 MB], 0:17:37