MPTRACKER: A NEW MULTI-PITCH DETECTION AND SEPARATION ALGORITHM FOR MIXED SPEECH SIGNALS
Speech Analysis
Přednášející: Hossein Radfar, Autoři: Hossein Radfar, University of Toronto, Canada; R. M. Dansereau, Carleton University, Canada; Wai-Yip Chan, Queen's University Belfast, Canada; W. Wong, University of Toronto, Canada
We present MPtracker, a new algorithm for tracking and separating the pitch frequencies of two speakers from their mixture. The pitch frequencies are detected by introducing a novel spectral distortion optimization which takes into account the sinusoidal modeling of the speech signal. The detected pitch frequencies are grouped, separated, and finally an interpolation method is applied to estimate missing pitch frequencies. We evaluated the performance of the proposed technique on 196 mixtures including 48 male-male, 48 female-female, and 96 male-female mixtures with target-to-interference ratios (TIR) ranging from 0 dB to +18 dB. The results show our simple but effective and fast technique significantly outperforms two widely-used approaches
Informace o přednášce
Nahráno: | 2011-05-25 09:50 - 10:10, Panorama |
---|---|
Přidáno: | 15. 6. 2011 15:18 |
Počet zhlédnutí: | 33 |
Rozlišení videa: | 1024x576 px, 512x288 px |
Délka videa: | 0:19:22 |
Audio stopa: | MP3 [6.54 MB], 0:19:22 |
Komentáře