MODEL-BASED COMPRESSIVE SENSING FOR MULTI-PARTY DISTANT SPEECH RECOGNITION
Robust ASR
Přednášející: Afsaneh Asaei, Autoři: Afsaneh Asaei, Hervé Bourlard, Volkan Cevher, Idiap Research Institute / Ecole Polytechnique Federale de Lausanne, Switzerland
We leverage the recent algorithmic advances in compressive sensing, and propose a novel source separation algorithm for efficient recovery of convolutive speech mixtures in spectro-temporal domain. Compared to the common sparse component analysis techniques, our approach fully exploits structured sparsity models to obtain substantial improvement over the existing state-of-the-art. We evaluate our method for separation and recognition of a target speaker in a multi-party scenario. Our results provide compelling evidence of the effectiveness of sparse recovery formulations in speech recognition.
Informace o přednášce
| Nahráno: | 2011-05-26 17:35 - 17:55, Panorama | 
|---|---|
| Přidáno: | 15. 6. 2011 19:18 | 
| Počet zhlédnutí: | 43 | 
| Rozlišení videa: | 1024x576 px, 512x288 px | 
| Délka videa: | 0:18:54 | 
| Audio stopa: | MP3 [6.00 MB], 0:18:54 | 













Komentáře