SuperLectures.com

CLUSTERING OF BOOTSTRAPPED ACOUSTIC MODEL WITH FULL COVARIANCE

Acoustic Modeling

Full Paper at IEEE Xplore

Presented by: Xin Chen, Author(s): Xin Chen, University of Missouri, United States; Xiaodong Cui, Jian Xue, Peder Olsen, IBM, United States; John Hersey, Mitsubishi, United States; Bowen Zhou, IBM, United States; Yunxin Zhao, University of Missouri, United States

HMM-based acoustic models built from bootstrap are generally very large, especially when full covariance matrices are used for Gaussians. Therefore, clustering is needed to compact the acoustic model to a reasonable size for practical applications. This paper discusses and investigates multiple distance measurements and algorithms for the clustering. The distance measurements include Entropy, KL, Bhattacharyya, Chernoff and their weighted versions. For clustering algorithms, besides conventional greedy bottom-up, algorithms such as N-Best distance Refinement (NBR), K-step Look-Ahead (KLA), Breadth-First Searched (BFS) best path are proposed. A two-pass optimization approach is also proposed to improve the model structure. Experiments in the Bootstrap and Restructuring (BSRS) framework on Pashto show that the discussed clustering approach can lead to better quality of the restructured model. It also shows that final acoustic model that is diagonalized from the full covariance yields good improvement over BSRS model directly with diagonal model and yields significant improvement over the conventional diagonal model.


  Speech Transcript

|

  Slides

Enlarge the slide | Show all slides in a pop-up window

0:00:29

  1. slide

0:00:43

  2. slide

0:01:38

  3. slide

0:02:19

  4. slide

0:03:11

  5. slide

0:03:43

  6. slide

0:04:07

  7. slide

0:04:44

  8. slide

0:05:10

  9. slide

0:05:49

 10. slide

0:06:51

 11. slide

0:07:29

 12. slide

0:08:31

 13. slide

0:09:33

 14. slide

0:10:16

 15. slide

0:11:25

 16. slide

0:12:29

 17. slide

0:13:53

 18. slide

0:14:12

 19. slide

0:15:23

 20. slide

0:16:17

 21. slide

0:16:39

     3. slide

0:17:59

    10. slide

0:20:12

 22. slide

  Comments

Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-25 14:25 - 14:45, Panorama
Added: 15. 6. 2011 16:00
Number of views: 345
Video resolution: 1024x576 px, 512x288 px
Video length: 0:20:42
Audio track: MP3 [7.00 MB], 0:20:42