Aggregate features and ADABOOST for music classification

被引:133
作者
Bergstra, James [1 ]
Casagrande, Norman [1 ]
Erhan, Dumitru [1 ]
Eck, Douglas [1 ]
Kegl, Balazs [1 ]
机构
[1] Univ Montreal, Dept Comp Sci, Montreal, PQ H3C 3J7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
genre classification; artist recognition; audio feature aggregation; multiclass ADABOOST; MIREX;
D O I
10.1007/s10994-006-9019-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an algorithm that predicts musical genre and artist from an audio waveform. Our method uses the ensemble learner ADABOOST to select from a set of audio features that have been extracted from segmented audio and then aggregated. Our classifier proved to be the most effective method for genre classification at the recent MIREX 2005 international contests in music information extraction, and the second-best method for recognizing artists. This paper describes our method in detail, from feature extraction to song classification, and presents an evaluation of our method on three genre databases and two artist-recognition databases. Furthermore, we present evidence collected from a variety of popular features and classifiers that the technique of classifying features aggregated over segments of audio is better than classifying either entire songs or individual short-timescale features.
引用
收藏
页码:473 / 484
页数:12
相关论文
共 29 条
  • [1] AHRENDT P, 2005, MUSIC GENRE CLASSIFI
  • [2] [Anonymous], 2000, SPEECH AUDIO SIGNAL
  • [3] [Anonymous], 2012, ROBUSTNESS AUTOMATIC
  • [4] Aucouturier J., 2002, P 3 INT C MUS INF RE
  • [5] Representing musical genre: A state of the art
    Aucouturier, JJ
    Pachet, F
    [J]. JOURNAL OF NEW MUSIC RESEARCH, 2003, 32 (01) : 83 - 93
  • [6] BELLO J, 2005, IEEE T SPEECH AUDIO
  • [7] BERGSTRA J, 2005, GENRE CLASSIFICATION
  • [8] BERGSTRA J, 2005, ARTIST RECOGNITION T
  • [9] Bishop C. M., 1996, Neural networks for pattern recognition
  • [10] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140