Full Paper at IEEE Xplore

Joint Audio Visual Processing

Přednášející: Jong-Seok Lee, Autoři: Jong-Seok Lee, Touradj Ebrahimi, Swiss Federal Institute of Technology in Lausanne, Switzerland

This letter proposes a method recovering audio-visual synchronization of multimedia content. It exploits the correlation between the acoustic and the visual signals in order to estimate the audio-visual drift existing in the content. By shifting the audio signal relative to the visual signal, the estimation of the drift is obtained by searching for the shift producing the maximal audio-visual correlation. We consider two correlation measures, namely, mutual information and canonical correlation, and compare their performance. Experimental results demonstrate that the method using the canonical correlation is effective in recovering the audio-visual synchronization for both speech and non-speech sequences.

  Přepis řeči



Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-24 16:15 - 16:35, Club H
Přidáno: 7. 6. 2011 19:18
Počet zhlédnutí: 11
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:19:08
Audio stopa: MP3 [6.54 MB], 0:19:08