Discovering Time-Constrained Sequential Patterns for Music Genre Classification

被引：21

作者：

Ren, Jia-Min ^{[1
]}

Jang, Jyh-Shing Roger ^{[1
]}

机构：

[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30013, Taiwan

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2012年 / 20卷 / 04期

关键词：

Data mining; hidden Markov model (HMM); music genre classification; time-constrained sequential pattern (TSP);

D O I：

10.1109/TASL.2011.2172426

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A music piece can be considered as a sequence of sound events which represent both short-term and long-term temporal information. However, in the task of automatic music genre classification, most of text-categorization-based approaches could only capture temporal local dependencies (e.g., unigram and bigram-based occurrence statistics) to represent music contents. In this paper, we propose the use of time-constrained sequential patterns (TSPs) as effective features for music genre classification. First of all, an automatic language identification technique is performed to tokenize each music piece into a sequence of hidden Markov model indices. Then TSP mining is applied to discover genre-specific TSPs, followed by the computation of occurrence frequencies of TSPs in each music piece. Finally, support vector machine classifiers are employed based on these occurrence frequencies to perform the classification task. Experiments conducted on two widely used datasets for music genre classification, GTZAN and ISMIR2004Genre, show that the proposed method can discover more discriminative temporal structures and achieve a better recognition accuracy than the unigram and bigram-based statistical approach.

引用

页码：1134 / 1144

页数：11

共 43 条

[1]

[Anonymous], 1997, ICML

[2]

[Anonymous], 2003, PRACTICAL GUIDE SUPP

[3]

[Anonymous], 2011, Pei. data mining concepts and techniques

[4] Representing musical genre: A state of the art [J].

Aucouturier, JJ ;

Pachet, F .

JOURNAL OF NEW MUSIC RESEARCH, 2003, 32 (01) :83-93

[5] Non-Negative Tensor Factorization Applied to Music Genre Classification [J].

Benetos, Emmanouil ;

Kotropoulos, Constantine .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08) :1955-1967

[6]

Berenson M.L., 1983, INTERMEDIATE STAT ME

[7] Aggregate features and ADABOOST for music classification [J].

Bergstra, James ;

Casagrande, Norman ;

Erhan, Dumitru ;

Eck, Douglas ;

Kegl, Balazs .

MACHINE LEARNING, 2006, 65 (2-3) :473-484

[8]

Case M. J., 2006, 2006 12th International Power Electronics and Motion Control Conference (IEEE Cat. No. 06EX1282)

[9]

Chang C., 2010, LIBSVM: A library for support vector machines 2010

[10] Music genres classification using text categorization method [J].

Chen, Kai ;

Gao, Sheng ;

Zhu, Yongwei ;

Sun, Qibin .

2006 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2006, :221-+

← 1 2 3 4 5 →