Music genre classification using LBP textural features

被引:125
作者
Costa, Y. M. G. [1 ,2 ]
Oliveira, L. S. [2 ]
Koerich, A. L. [2 ,3 ]
Gouyon, F. [4 ]
Martins, J. G. [2 ,5 ]
机构
[1] State Univ Maringa UEM, BR-87020900 Maringa, PR, Brazil
[2] Fed Univ Parana UFPR, BR-81531990 Curitiba, PR, Brazil
[3] Pontifical Catholic Univ Parana PUCPR, BR-80215901 Curitiba, PR, Brazil
[4] Inst Syst & Comp Engn Porto INESC, P-4200465 Porto, Portugal
[5] Fed Technol Univ Parana UTFPR, BR-85902490 Toledo, PR, Brazil
关键词
Music genre; Texture; Image processing; Pattern recognition; SELECTION;
D O I
10.1016/j.sigpro.2012.04.023
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we present an approach to music genre classification which converts an audio signal into spectrograms and extracts texture features from these time-frequency images which are then used for modeling music genres in a classification system. The texture features are based on Local Binary Pattern, a structural texture operator that has been successful in recent image classification research. Experiments are performed with two well-known datasets: the Latin Music Database (LMD), and the ISMIR 2004 dataset. The proposed approach takes into account some different zoning mechanisms to perform local feature extraction. Results obtained with and without local feature extraction are compared. We compare the performance of texture features with that of commonly used audio content based features (i.e. from the MARSYAS framework), and show that texture features always outperforms the audio content based features. We also compare our results with results from the literature. On the LMD, the performance of our approach reaches about 82.33%, above the best result obtained in the MIREX 2010 competition on that dataset. On the ISMIR 2004 database, the best result obtained is about 80.65%, i.e. below the best result on that dataset found in the literature. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:2723 / 2737
页数:15
相关论文
共 35 条
[1]  
[Anonymous], 2001, P COST G6 C DIG AUD
[2]   Representing musical genre: A state of the art [J].
Aucouturier, JJ ;
Pachet, F .
JOURNAL OF NEW MUSIC RESEARCH, 2003, 32 (01) :83-93
[3]  
Cao C., 2009, THINKITS SUBMISSION
[4]  
Costa CHL, 2004, IEEE SYS MAN CYBERN, P562
[5]  
Costa Y. M. G., 2011, 2011 18th International Conference on Systems, Signals and Image Processing, P1
[6]  
Daniel F.Pachet., 2000, Proceedings RIAO of Content-Based Multimedia Information Access, P1238
[7]  
Flexer A., 2007, P 8 INT C MUS INF RE, P341
[8]  
French M, 2007, J ENG TECHNOL, V24, P32
[9]  
Gantz J., 2008, 20041068 USGS
[10]   Scanning the Dial: The Rapid Recognition of Music Genres [J].
Gjerdingen, Robert O. ;
Perrott, David .
JOURNAL OF NEW MUSIC RESEARCH, 2008, 37 (02) :93-100