SuperLectures.com

COST-SENSITIVE STACKING FOR AUDIO TAG ANNOTATION AND RETRIEVAL

Multimedia Indexing and Retrieval

Full Paper at IEEE Xplore

Přednášející: Hung-Yi Lo, Autoři: Hung-Yi Lo, Ju-Chiang Wang, Hsin-Min Wang, Institute of Information Science / Academia Sinica, Taiwan; Shou-De Lin, National Taiwan University, Taiwan

Audio tags correspond to keywords that people use to describe different aspects of a music clip, such as the genre, mood, and instrumentation. Since social tags are usually assigned by people with different levels of musical knowledge, they inevitably contain noisy information. By treating the tag counts as costs, we can model the audio tagging problem as a cost-sensitive classification problem. In addition, tag correlation is another useful information for automatic audio tagging since some tags often co-occur. By considering the co-occurrences of tags, we can model the audio tagging problem as a multi-label classification problem. To exploit the tag count and correlation information jointly, we formulate the audio tagging task as a novel cost-sensitive multi-label (CSML) learning problem. The results of audio tag annotation and retrieval experiments demonstrate that the new approach outperforms our MIREX 2009 winning method.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:16

  1. slajd

0:00:35

  2. slajd

0:01:49

  3. slajd

0:02:30

  4. slajd

0:03:45

  5. slajd

0:04:15

  6. slajd

0:05:28

  7. slajd

0:05:43

  8. slajd

0:06:21

  9. slajd

0:07:41

 10. slajd

0:08:39

 11. slajd

0:09:37

 12. slajd

0:10:33

 13. slajd

0:12:08

 14. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-25 14:05 - 14:25, Club H
Přidáno: 9. 6. 2011 00:15
Počet zhlédnutí: 33
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:14:18
Audio stopa: MP3 [4.88 MB], 0:14:18