IMPROVING MELODY EXTRACTION USING PROBABILISTIC LATENT COMPONENT ANALYSIS

Music Signal Processing

Presented by: Jinyu Han, Author(s): Jinyu Han, Northwestern University, United States; Ching-Wei Chen, Gracenote, United States

We propose a new approach for automatic melody extraction from polyphonic audio, based on Probabilistic Latent Component Analysis (PLCA). An audio signal is first divided into vocal and non-vocal segments using a trained Gaussian Mixture Model (GMM) classifier. A statistical model of the non-vocal segments of the signal is then learned adaptively from this particular input music by PLCA. This model is then employed to remove the accompaniment from the mixture, leaving mainly the vocal components. The melody line is extracted from the vocal components using an auto-correlation algorithm. Quantitative evaluation shows that the new system performs significantly better than two existing melody extraction algorithms for polyphonic single-channel mixtures.

You need the Flash Player.

Share:

Download subtitles | Enlarge video

1. slide