SuperLectures.com

AN APPROACH TO SEQUENTIAL GROUPING IN COCHANNEL SPEECH

Full Paper at IEEE Xplore

Speech Enhancement

Presented by: DeLiang Wang, Author(s): Ke Hu, DeLiang Wang, The Ohio State University, United States

Model-based methods for sequential organization in cochannel speech require pretrained speaker models and often prior knowledge of participating speakers. We propose an unsupervised approach to sequential organization of cochannel speech. Based on cepstral features, we first cluster voiced speech into two speaker groups by maximizing the ratio of between- and within-group distances penalized by within-group concurrent pitches. To group unvoiced speech, we employ an onset/offset based analysis to generate time-frequency segments. Unvoiced segments are then labeled by the complementary portions of segregated voiced speech. Our method does not require any pretrained model and is computationally simple. Evaluations and comparisons show that the proposed method outperforms a model-based method in terms of speech segregation.


  Speech Transcript

|

  Slides

Enlarge the slide | Show all slides in a pop-up window

0:00:16

  1. slide

0:00:52

  2. slide

0:01:15

  3. slide

0:03:06

  4. slide

0:05:25

  5. slide

0:06:24

  6. slide

0:07:58

  7. slide

0:08:55

  8. slide

0:09:19

  9. slide

0:10:36

 10. slide

0:11:39

 11. slide

0:12:31

 12. slide

0:14:44

 13. slide

0:15:15

 14. slide

0:16:38

 15. slide

0:17:11

 16. slide

  Comments

Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-27 16:35 - 16:55, Panorama
Added: 9. 6. 2011 04:47
Number of views: 23
Video resolution: 1024x576 px, 512x288 px
Video length: 0:21:31
Audio track: MP3 [7.36 MB], 0:21:31