Full Paper at IEEE Xplore

Modeling and Analysis of Speech Production

Presented by: Drugman Thomas, Author(s): Thomas Drugman, Thomas Dubuisson, Thierry Dutoit, University of Mons, Belgium

In most current approaches of speech processing, information is extracted from the magnitude spectrum. However recent perceptual studies have underlined the importance of the phase component. The goal of this paper is to investigate the potential of using phase-based features for automatically detecting voice disorders. It is shown that group delay functions are appropriate for characterizing irregularities in the phonation. Besides the respect of the mixed-phase model of speech is discussed. The proposed phase-based features are evaluated and compared to other parameters derived from the magnitude spectrum. Both streams are shown to be interestingly complementary. Furthermore phase-based features turn out to convey a great amount of relevant information, leading to high discrimination performance.


Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-27 09:50 - 10:10, Panorama
Added: 9. 6. 2011 10:42
Number of views: 27
Video resolution: 1024x576 px, 512x288 px
Video length: 0:19:54