On the use of GSV-SVM for Speaker Diarization and Tracking

SESSION 6: Diarization

Přidáno: 14. 7. 2010 11:08, Autor: Viet Bac Le (LIMSI-CNRS), Claude Barras (LIMSI-CNRS, Univ. Paris-Sud), Marc Ferras (LIMSI-CNRS), Délka: 0:21:58

In this paper, we present the use of Gaussian Supervectors with Support Vector Machines classifiers (GSV-SVM) in an acoustic speaker diarization and a speaker tracking system, compared with a standard Gaussian Mixture Model system based on adapted Universal Background Models (GMM-UBM). GSV-SVM systems (which share the adaptation step with the GMM-UBM systems) are observed to have comparable performances: for acoustic speaker diarization, the GMM-UBM system outperforms the GSV-SVM system on ESTER2 data but the latter system works better in the speaker tracking system. In particular, the linear combination of two systems at the score level outperforms each individual system.

  Přepis řeči



  Informace o přednášce

Počet zhlédnutí: 988
Rozlišení videa: 720x576 px
Audio stopa: MP3 [7.54 MB], 0:21:58