SuperLectures.com

COST-SENSITIVE STACKING FOR AUDIO TAG ANNOTATION AND RETRIEVAL

Full Paper at IEEE Xplore

Multimedia Indexing and Retrieval

Presented by: Hung-Yi Lo, Author(s): Hung-Yi Lo, Ju-Chiang Wang, Hsin-Min Wang, Institute of Information Science / Academia Sinica, Taiwan; Shou-De Lin, National Taiwan University, Taiwan

Audio tags correspond to keywords that people use to describe different aspects of a music clip, such as the genre, mood, and instrumentation. Since social tags are usually assigned by people with different levels of musical knowledge, they inevitably contain noisy information. By treating the tag counts as costs, we can model the audio tagging problem as a cost-sensitive classification problem. In addition, tag correlation is another useful information for automatic audio tagging since some tags often co-occur. By considering the co-occurrences of tags, we can model the audio tagging problem as a multi-label classification problem. To exploit the tag count and correlation information jointly, we formulate the audio tagging task as a novel cost-sensitive multi-label (CSML) learning problem. The results of audio tag annotation and retrieval experiments demonstrate that the new approach outperforms our MIREX 2009 winning method.


  Speech Transcript

|

  Slides

Enlarge the slide | Show all slides in a pop-up window

0:00:16

  1. slide

0:00:35

  2. slide

0:01:49

  3. slide

0:02:30

  4. slide

0:03:45

  5. slide

0:04:15

  6. slide

0:05:28

  7. slide

0:05:43

  8. slide

0:06:21

  9. slide

0:07:41

 10. slide

0:08:39

 11. slide

0:09:37

 12. slide

0:10:33

 13. slide

0:12:08

 14. slide

  Comments

Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-25 14:05 - 14:25, Club H
Added: 9. 6. 2011 00:15
Number of views: 33
Video resolution: 1024x576 px, 512x288 px
Video length: 0:14:18
Audio track: MP3 [4.88 MB], 0:14:18