SuperLectures.com

SPEAKER DIARIZATION OF MEETINGS BASED ON SPEAKER ROLE N-GRAM MODELS

Full Paper at IEEE Xplore

Speaker Diarization

Presented by: Petr Motlíček, Author(s): Fabio Valente, Deepu Vijayasenan, Petr Motlicek, Idiap Research Institute, Switzerland

Speaker diarization of meeting recordings is generally based on acoustic information ignoring that meetings are instances of conversations. Several recent works have shown that the sequence of speakers in a conversation and their roles are related and statistically predictable. This paper proposes the use of speaker roles n-gram model to capture the conversation patterns probability and investigates its use as prior information into a state-of-the-art diarization system. Experiments are run on the AMI corpus annotated in terms of roles. The proposed technique reduces the speaker error by 19\% when the roles are known and by 17\% when they are estimated. Furthermore the paper investigates how the n-gram models generalize to different settings like those from the Rich Transcription campaigns. Experiments on 17 meetings reveal that the speaker error can be reduced by 12\% also in this case thus the n-gram can generalize across corpora.


  Speech Transcript

|

  Slides

Enlarge the slide | Show all slides in a pop-up window

0:00:18

  1. slide

0:01:00

  2. slide

0:02:27

  3. slide

0:03:20

  4. slide

0:05:04

  5. slide

0:06:15

  6. slide

0:07:00

  7. slide

0:08:41

  8. slide

0:13:06

  9. slide

0:13:34

 10. slide

0:14:29

 11. slide

0:15:14

 12. slide

0:16:03

 13. slide

0:16:37

 14. slide

0:18:47

    13. slide

  Comments

Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-24 13:45 - 14:05, Panorama
Added: 16. 6. 2011 18:57
Number of views: 29
Video resolution: 1024x576 px, 512x288 px
Video length: 0:22:10
Audio track: MP3 [7.50 MB], 0:22:10