SuperLectures.com

BAYESIAN INTEGRATION OF AUDIO AND VISUAL INFORMATION FOR MULTI-TARGET TRACKING USING A CB-MEMBER FILTER

Full Paper at IEEE Xplore

Joint Audio Visual Processing

Přednášející: Reza Hoseinnezhad, Autoři: Reza Hoseinnezhad, RMIT University, Australia; Ba-Ngu Vo, Ba-Tuong Vo, The University of Western Australia, Australia; David Suter, The University of Adelaide, Australia

A new method is presented for integration of audio and visual information in multiple target tracking applications. The proposed approach uses a Bayesian filtering formulation and exploits multi-Bernoulli random finite set approximations. The work presented in this paper is the first principled Bayesian estimation approach to solve the sensor fusion problems that involve intermittent sensory data (e.g. audio data for a person who occasionally speaks.) We have examined our method with case studies from the SPEVI database. The results show nearly perfect tracking of people not only when they are silent but also when they are not visible to the camera (but speaking).


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:16

  1. slajd

0:00:39

  2. slajd

0:01:40

  3. slajd

0:02:30

     3. slajd

0:03:08

     3. slajd

0:03:39

  4. slajd

0:03:49

  5. slajd

0:04:59

  6. slajd

0:07:30

  7. slajd

0:09:17

  8. slajd

0:10:07

  9. slajd

0:11:08

 10. slajd

0:12:49

 11. slajd

0:13:48

 12. slajd

0:14:35

 13. slajd

0:16:38

 14. slajd

0:19:07

 15. slajd

0:21:33

 16. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-24 17:55 - 18:15, Club H
Přidáno: 9. 6. 2011 00:58
Počet zhlédnutí: 35
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:22:59
Audio stopa: MP3 [7.86 MB], 0:22:59