SuperLectures.com

STRUCTURED OUTPUT LAYER NEURAL NETWORK LANGUAGE MODEL

Language Modeling

Full Paper at IEEE Xplore

Presented by: Ilya Oparin, Author(s): Hai Son Le, LIMSI CNRS / Uni. Paris-Sud, France; Ilya Oparin, LIMSI CNRS, France; Alexandre Allauzen, LIMSI CNRS / Uni. Paris-Sud, France; Jean-Luc Gauvain, LIMSI CNRS, France; Francois Yvon, LIMSI CNRS / Uni. Paris-Sud, France

This paper introduces a new neural network language model (NNLM) based on word clustering to structure the output vocabulary: Structured Output Layer NNLM. This model is able to handle vocabularies of arbitrary size, hence dispensing with the design of short-lists that are commonly used in NNLMs. Several softmax layers replace the standard output layer in this model. The output structure depends on the word clustering which uses the continuous word representation induced by a NNLM. The GALE Mandarin data was used to carry out the speech-to-text experiments and evaluate the NNLMs. On this data the well tuned baseline system has a character error rate under 10%. Our model achieves consistent improvements over the combination of an n-gram model and classical short-list NNLMs both in terms of perplexity and recognition accuracy.


  Speech Transcript

|

  Slides

Enlarge the slide | Show all slides in a pop-up window

0:00:16

  1. slide

0:00:27

  2. slide

0:00:44

  3. slide

0:01:21

  4. slide

0:01:57

  5. slide

0:02:16

  6. slide

0:02:30

  7. slide

0:02:42

  8. slide

0:02:52

  9. slide

0:03:01

 10. slide

0:04:13

 11. slide

0:04:25

 12. slide

0:04:43

 13. slide

0:05:52

 14. slide

0:07:18

 15. slide

0:08:20

 16. slide

0:09:51

 17. slide

0:10:41

 18. slide

0:12:05

 19. slide

0:12:16

 20. slide

0:12:26

 21. slide

0:13:09

 22. slide

0:13:30

 23. slide

0:14:49

 24. slide

0:17:42

 25. slide

0:19:12

     8. slide

0:19:33

    12. slide

  Comments

Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-25 16:55 - 17:15, Club H
Added: 9. 6. 2011 02:30
Number of views: 49
Video resolution: 1024x576 px, 512x288 px
Video length: 0:22:23
Audio track: MP3 [7.65 MB], 0:22:23