SuperLectures.com

SIMPLIFICATION AND OPTIMIZATION OF I-VECTOR EXTRACTION

Miscellaneous Speaker Identification

Full Paper at IEEE Xplore

Přednášející: Ondřej Glembek, Autoři: Ondrej Glembek, Lukas Burget, Pavel Matejka, Martin Karafiát, Brno University of Technology, Czech Republic; Patrick Kenny, CRIM, Canada

This paper introduces some simplifications to the i-vector speaker recognition systems. I-vector extraction as well as training of the i-vector extractor can be an expensive task both in terms of memory and speed. Under certain assumptions, the formulas for i-vector extraction---also used in i-vector extractor training---can be simplified and lead to a faster and memory more efficient code. The first assumption is that the GMM component alignment is constant across utterances and is given by the UBM GMM weights. The second assumption is that the i-vector extractor matrix can be linearly transformed so that its per-Gaussian components are orthogonal. We use PCA and HLDA to estimate this transform.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:20

  1. slajd

0:00:42

  2. slajd

0:01:47

  3. slajd

0:02:23

  4. slajd

0:03:24

  5. slajd

0:05:23

  6. slajd

0:05:44

  7. slajd

0:07:15

  8. slajd

0:08:35

  9. slajd

0:09:56

 10. slajd

0:10:37

 11. slajd

0:11:44

 12. slajd

0:13:10

 13. slajd

0:13:51

 14. slajd

0:14:45

 15. slajd

0:15:40

 16. slajd

0:16:25

 17. slajd

0:16:58

 18. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-25 16:35 - 16:55, Panorama
Přidáno: 15. 6. 2011 16:40
Počet zhlédnutí: 66
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:20:08
Audio stopa: MP3 [6.81 MB], 0:20:08