Automatic Music Genre Classification Based on Modulation Spectral Analysis of Spectral and Cepstral Features

被引:95
作者
Lee, Chang-Hsing [1 ]
Shih, Jau-Ling [1 ]
Yu, Kun-Ming [1 ]
Lin, Hwai-San [1 ]
机构
[1] Chung Hua Univ, Dept Comp Sci & Informat Engn, Hsinchu 300, Taiwan
关键词
Mel-frequency cepstral coefficients; modulation spectral analysis; music genre classification; normalized audio spectrum envelope; octave-based spectral contrast;
D O I
10.1109/TMM.2009.2017635
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we will propose an automatic music genre classification approach based on long-term modulation spectral analysis of spectral (OSC and MPEG-7 NASE) as well as cepstral (MFCC) features. Modulation spectral analysis of every feature value will generate a corresponding modulation spectrum and all the modulation spectra can be collected to form a modulation spectrogram which exhibits the time-varying or rhythmic information of music signals. Each modulation spectrum is then decomposed into several logarithmically-spaced modulation subbands. The modulation spectral contrast (MSC) and modulation spectral valley (MSV) are then computed from each modulation subband. Effective and compact features are generated from statistical aggregations of the MSCs and MSVs of all modulation subbands. An information fusion approach which integrates both feature level fusion method and decision level combination method is employed to improve the classification accuracy. Experiments conducted on two different music datasets have shown that our proposed approach can achieve higher classification accuracy than other approaches with the same experimental setup.
引用
收藏
页码:670 / 682
页数:13
相关论文
共 36 条
[1]   Joint acoustic and modulation frequency [J].
Atlas, L ;
Shamma, SA .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (07) :668-675
[2]   Representing musical genre: A state of the art [J].
Aucouturier, JJ ;
Pachet, F .
JOURNAL OF NEW MUSIC RESEARCH, 2003, 32 (01) :83-93
[3]   Automatic classification,of musical genres using inter-genre similarity [J].
Bagci, Ulas ;
Erzin, Engin .
IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (08) :521-524
[4]   Aggregate features and ADABOOST for music classification [J].
Bergstra, James ;
Casagrande, Norman ;
Erhan, Dumitru ;
Eck, Douglas ;
Kegl, Balazs .
MACHINE LEARNING, 2006, 65 (2-3) :473-484
[5]  
Duda R.O., 1973, Pattern Classification and Scene Analysis
[6]   Characterizing frequency selectivity for envelope fluctuations [J].
Ewert, SD ;
Dau, T .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (03) :1181-1196
[7]  
Grimaldi M., 2003, P 5 ACM SIGMM INT WO, P102, DOI [DOI 10.1145/973264.973281, 10.1145/973264.973281]
[8]   Music type classification by spectral contrast feature [J].
Jiang, DN ;
Lu, L ;
Zhang, HJ ;
Tao, JH ;
Cai, LH .
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, :113-116
[9]   On the relative importance of various components of the modulation spectrum for automatic speech recognition [J].
Kanedera, N ;
Arai, T ;
Hermansky, H ;
Pavel, M .
SPEECH COMMUNICATION, 1999, 28 (01) :43-55
[10]  
Kim HG, 2005, MPEG-7 AUDIO AND BEYOND: AUDIO CONTENT INDEXING AND RETRIEVAL, P1, DOI 10.1002/0470093366