SuperLectures.com

MPTRACKER: A NEW MULTI-PITCH DETECTION AND SEPARATION ALGORITHM FOR MIXED SPEECH SIGNALS

Speech Analysis

Full Paper at IEEE Xplore

Přednášející: Hossein Radfar, Autoři: Hossein Radfar, University of Toronto, Canada; R. M. Dansereau, Carleton University, Canada; Wai-Yip Chan, Queen's University Belfast, Canada; W. Wong, University of Toronto, Canada

We present MPtracker, a new algorithm for tracking and separating the pitch frequencies of two speakers from their mixture. The pitch frequencies are detected by introducing a novel spectral distortion optimization which takes into account the sinusoidal modeling of the speech signal. The detected pitch frequencies are grouped, separated, and finally an interpolation method is applied to estimate missing pitch frequencies. We evaluated the performance of the proposed technique on 196 mixtures including 48 male-male, 48 female-female, and 96 male-female mixtures with target-to-interference ratios (TIR) ranging from 0 dB to +18 dB. The results show our simple but effective and fast technique significantly outperforms two widely-used approaches


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:21

  1. slajd

0:00:51

  2. slajd

0:02:38

  3. slajd

0:03:26

  4. slajd

0:04:40

  5. slajd

0:05:11

  6. slajd

0:06:31

  7. slajd

0:07:32

  8. slajd

0:08:17

  9. slajd

0:09:04

 10. slajd

0:10:04

 11. slajd

0:10:50

 12. slajd

0:11:26

 13. slajd

0:12:06

 14. slajd

0:13:23

 15. slajd

0:14:46

 16. slajd

0:15:10

 17. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-25 09:50 - 10:10, Panorama
Přidáno: 15. 6. 2011 15:18
Počet zhlédnutí: 33
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:19:22
Audio stopa: MP3 [6.54 MB], 0:19:22