In this paper, we will propose an automatic music genre classification approach based on long-term modulation spectral analysis on the static and transitional information of spectral (OSC and MPEG-7 NASE) as well as cepstral (MFCC) features. An information fusion approach which integrates both feature level fusion and decision level combination is employed to improve the classification accuracy. Experiments conducted on the music database employed in the ISMIR2004 audio description contest have shown that the proposed approach can achieve a classification accuracy of 87.79%, which is better than the winner of the contest.
Published in:
Intelligent Information Hiding and Multimedia Signal Processing, 2009. IIH-MSP '09. Fifth International Conference on
Date of Conference: 12-14 Sept. 2009