SuperLectures.com

A DYNAMIC APPROACH TO THE SELECTION OF HIGH ORDER N-GRAMS IN PHONOTACTIC LANGUAGE RECOGNITION

Language Identification

Full Paper at IEEE Xplore

Přednášející: Mikel Penagarikano, Autoři: Mikel Penagarikano, Amparo Varona, Luis Javier Rodriguez-Fuentes, German Bordel, University of the Basque Country, Spain

Due to computational bounds, most SVM-based phonotactic language recognition systems consider only low-order n-grams (up to n=3), thus limiting the potential performance of this approach. The huge amount of n-grams for n>=4 makes it computationally unfeasible even selecting the most frequent n-grams. In this paper, we demonstrate the feasibility and usefulness of using high-order n-grams for n=4;5;6;7 in SVM-based phonotactic language recognition, thanks to a dynamic n-gram selection algorithm. The most frequent n-grams are selected, but computational issues (those regarding memory requirements) are prevented, since counts are periodically updated and only those units with the highest counts are retained for subsequent processing. Systems were built by means of open software (Brno University of Technology phone decoders, HTK, LIBLINEAR and FoCal) and experiments were carried out on the NIST LRE2007 database. Applying the proposed approach, a 1.36% EER was achieved when using up to 4-grams, 1.32% EER when using up to 5-grams (11.2% improvement with regard to using up to 3-grams) and 1.34% EER when using up to 6-grams or 7-grams.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:16

  1. slajd

0:00:46

  2. slajd

0:02:02

  3. slajd

0:02:32

  4. slajd

0:04:32

  5. slajd

0:05:27

  6. slajd

0:06:16

  7. slajd

0:06:35

  8. slajd

0:07:04

  9. slajd

0:09:46

 10. slajd

0:10:56

 11. slajd

0:12:48

    11. slajd

0:13:03

 12. slajd

0:14:09

     1. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-24 11:55 - 12:15, Panorama
Přidáno: 16. 6. 2011 14:04
Počet zhlédnutí: 30
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:17:16
Audio stopa: MP3 [5.82 MB], 0:17:16