A FEATURE SELECTION APPROACH FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

被引：7

作者：

Silla, Carlos N., Jr. ^{[1
]}

Koerich, Alessandro L. ^{[2
]}

Kaestner, Celso A. A. ^{[3
]}

机构：

[1] Univ Kent, Comp Lab, Canterbury CT2 7NF, Kent, England

[2] Pontificia Univ Catolica Parana, BR-80230901 Curitiba, PR, Brazil

[3] Univ Tecnol Fed Parana, BR-80230901 Curitiba, PR, Brazil

来源：

INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING | 2009年 / 3卷 / 02期

关键词：

Music classification; feature selection; audio processing;

D O I：

10.1142/S1793351X09000719

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present an analysis of the suitability of four different feature sets which are currently employed to represent music signals in the context of the automatic music genre classification. To such an aim, feature selection is carried out through genetic algorithms, and it is applied to multiple feature vectors generated from different segments of the music signal. The feature sets used in this paper, which encompass time-domain and frequency-domain characteristics of the music signal, comprise: short-time Fourier transform, Mel frequency cepstral coefficient, beat-related features, pitch-related features, inter-onset interval histogram coefficients, rhythm histograms and statistical spectrum descriptors. The classification is based on the use of multiple feature vectors and an ensemble approach, according to time and space decomposition strategies. Feature vectors are extracted from music segments from the beginning, middle and end parts of the music signal (time-decomposition). Despite music genre classification being a multi-class problem, we accomplish the task using a combination of binary classifiers, whose results are merged to produce the final music genre label (space decomposition). Experiments were carried out on two databases: the Latin Music Database, which contains 3,227 music pieces categorized into ten musical genres; the ISMIR' 2004 genre contest database which contains 1,458 music pieces categorized into six popular western musical genres. The experimental results have shown that the feature sets have different importance according to the part of the music signal from where the feature vectors are extracted. Furthermore, the ensemble approach provides better results than the individual segments in most cases. For high-dimensional feature sets, the feature selection provides a compact but discriminative feature subset which has an interesting trade-off between classification accuracy and computational effort.

引用

页码：183 / 208

页数：26

共 44 条

[1]

[Anonymous], ISMIR

[2]

[Anonymous], 2005, P INT C MUS INF

[3]

[Anonymous], 2008, INT SOC MUS INF RETR

[4] Representing musical genre: A state of the art [J].

Aucouturier, JJ ;

Pachet, F .

JOURNAL OF NEW MUSIC RESEARCH, 2003, 32 (01) :83-93

[5] Aggregate features and ADABOOST for music classification [J].

Bergstra, James ;

Casagrande, Norman ;

Erhan, Dumitru ;

Eck, Douglas ;

Kegl, Balazs .

MACHINE LEARNING, 2006, 65 (2-3) :473-484

[6] Selection of relevant features and examples in machine learning [J].

Blum, AL ;

Langley, P .

ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271

[7]

Cano P., 2006, MTGTR200602 POMP FAB

[8]

Costa CHL, 2004, IEEE SYS MAN CYBERN, P562

[9]

Dash M., 1997, Intelligent Data Analysis, V1

[10] Ensemble methods in machine learning [J].

Dietterich, TG .

MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15

← 1 2 3 4 5 →