SuperLectures.com

CLUSTERING OF BOOTSTRAPPED ACOUSTIC MODEL WITH FULL COVARIANCE

Acoustic Modeling

Full Paper at IEEE Xplore

Přednášející: Xin Chen, Autoři: Xin Chen, University of Missouri, United States; Xiaodong Cui, Jian Xue, Peder Olsen, IBM, United States; John Hersey, Mitsubishi, United States; Bowen Zhou, IBM, United States; Yunxin Zhao, University of Missouri, United States

HMM-based acoustic models built from bootstrap are generally very large, especially when full covariance matrices are used for Gaussians. Therefore, clustering is needed to compact the acoustic model to a reasonable size for practical applications. This paper discusses and investigates multiple distance measurements and algorithms for the clustering. The distance measurements include Entropy, KL, Bhattacharyya, Chernoff and their weighted versions. For clustering algorithms, besides conventional greedy bottom-up, algorithms such as N-Best distance Refinement (NBR), K-step Look-Ahead (KLA), Breadth-First Searched (BFS) best path are proposed. A two-pass optimization approach is also proposed to improve the model structure. Experiments in the Bootstrap and Restructuring (BSRS) framework on Pashto show that the discussed clustering approach can lead to better quality of the restructured model. It also shows that final acoustic model that is diagonalized from the full covariance yields good improvement over BSRS model directly with diagonal model and yields significant improvement over the conventional diagonal model.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:29

  1. slajd

0:00:43

  2. slajd

0:01:38

  3. slajd

0:02:19

  4. slajd

0:03:11

  5. slajd

0:03:43

  6. slajd

0:04:07

  7. slajd

0:04:44

  8. slajd

0:05:10

  9. slajd

0:05:49

 10. slajd

0:06:51

 11. slajd

0:07:29

 12. slajd

0:08:31

 13. slajd

0:09:33

 14. slajd

0:10:16

 15. slajd

0:11:25

 16. slajd

0:12:29

 17. slajd

0:13:53

 18. slajd

0:14:12

 19. slajd

0:15:23

 20. slajd

0:16:17

 21. slajd

0:16:39

     3. slajd

0:17:59

    10. slajd

0:20:12

    11. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-25 14:25 - 14:45, Panorama
Přidáno: 15. 6. 2011 16:00
Počet zhlédnutí: 345
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:20:42
Audio stopa: MP3 [7.00 MB], 0:20:42