SuperLectures.com

PHONEME SELECTIVE SPEECH ENHANCEMENT USING THE GENERALIZED PARAMETRIC SPECTRAL SUBTRACTION ESTIMATOR

Full Paper at IEEE Xplore

Speech Enhancement

Přednášející: John Hansen, Autoři: Amit Das, University of Colorado Boulder / University of Texas at Dallas, United States; John Hansen, The University of Texas at Dallas, United States

In this study, the generalized parametric spectral subtraction estimator is employed in the context of a ROVER speech enhancement framework to develop a robust phoneme class selective enhancement algorithm. The parametric estimator is derived by a) optimizing the weighted Euclidean distortion cost function and b) by modeling clean speech spectral magnitudes as Rayleigh distributed priors. A set of enhanced utterances are generated from a single noisy utterance by tuning the parameters of the parametric estimator for different phoneme classes. The speech and non-speech segments are segregated using a voice activity detector. Thereafter, the mixture maximum model is used to make soft decisions on these segments to determine their phoneme class weights. The segments from the enhanced utterances are weighted by these decisions and combined to form the final composite utterance. Using segmental SNR and Itakura-Saito metrics over two noise types and four SNR levels, it was demonstrated that the composite utterance exhibited better phoneme class improvement than the individual utterances enhanced from the parametric estimator.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:16

  1. slajd

0:00:42

  2. slajd

0:01:43

  3. slajd

0:03:04

  4. slajd

0:05:04

  5. slajd

0:06:05

  6. slajd

0:07:31

  7. slajd

0:08:43

  8. slajd

0:09:29

  9. slajd

0:10:55

 10. slajd

0:12:32

 11. slajd

0:13:32

 12. slajd

0:14:11

 13. slajd

0:16:19

 14. slajd

0:18:13

 15. slajd

0:18:42

 16. slajd

0:19:01

 17. slajd

0:19:29

 18. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-27 17:35 - 17:55, Panorama
Přidáno: 9. 6. 2011 05:36
Počet zhlédnutí: 37
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:21:47
Audio stopa: MP3 [7.45 MB], 0:21:47