SuperLectures.com

PHONEME SELECTIVE SPEECH ENHANCEMENT USING THE GENERALIZED PARAMETRIC SPECTRAL SUBTRACTION ESTIMATOR

Full Paper at IEEE Xplore

Speech Enhancement

Presented by: John Hansen, Author(s): Amit Das, University of Colorado Boulder / University of Texas at Dallas, United States; John Hansen, The University of Texas at Dallas, United States

In this study, the generalized parametric spectral subtraction estimator is employed in the context of a ROVER speech enhancement framework to develop a robust phoneme class selective enhancement algorithm. The parametric estimator is derived by a) optimizing the weighted Euclidean distortion cost function and b) by modeling clean speech spectral magnitudes as Rayleigh distributed priors. A set of enhanced utterances are generated from a single noisy utterance by tuning the parameters of the parametric estimator for different phoneme classes. The speech and non-speech segments are segregated using a voice activity detector. Thereafter, the mixture maximum model is used to make soft decisions on these segments to determine their phoneme class weights. The segments from the enhanced utterances are weighted by these decisions and combined to form the final composite utterance. Using segmental SNR and Itakura-Saito metrics over two noise types and four SNR levels, it was demonstrated that the composite utterance exhibited better phoneme class improvement than the individual utterances enhanced from the parametric estimator.


  Speech Transcript

|

  Slides

Enlarge the slide | Show all slides in a pop-up window

0:00:16

  1. slide

0:00:42

  2. slide

0:01:43

  3. slide

0:03:04

  4. slide

0:05:04

  5. slide

0:06:05

  6. slide

0:07:31

  7. slide

0:08:43

  8. slide

0:09:29

  9. slide

0:10:55

 10. slide

0:12:32

 11. slide

0:13:32

 12. slide

0:14:11

 13. slide

0:16:19

 14. slide

0:18:13

 15. slide

0:18:42

 16. slide

0:19:01

 17. slide

0:19:29

 18. slide

  Comments

Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-27 17:35 - 17:55, Panorama
Added: 9. 6. 2011 05:36
Number of views: 37
Video resolution: 1024x576 px, 512x288 px
Video length: 0:21:47
Audio track: MP3 [7.45 MB], 0:21:47