Music genre classification using temporal domain features

被引:0
作者
Shiu, Y [1 ]
Kuo, CCJ [1 ]
机构
[1] Univ So Calif, Dept Elect Engn, Los Angeles, CA 90089 USA
来源
INTERNET MULTIMEDIA MANAGEMENT SYSTEMS V | 2004年 / 5601卷
关键词
music genre classification; music information retrieval; music database; hidden Markov model; pattern recognition;
D O I
10.1117/12.571369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Music genre provides an efficient way to index songs in the music database, and can be used as an effective means to retrieval music of a similar type, i.e. content-based music retrieval. In addition to other features, the temporal domain features of a music signal are exploited so as to increase the classification rate in this research. Three temporal techniques are examined in depth. First, the hidden Markov model (HMM) is used to emulate the time-varying properties of music signals. Second, to further increase the classification rate, we propose another feature set that focuses on the residual part of music signals. Third, the overall classification rate is enhanced by classifying smaller segments from a test material individually and making decision via majority voting. Experimental results are given to demonstrate the performance of the proposed techniques.
引用
收藏
页码:79 / 90
页数:12
相关论文
共 8 条
  • [1] [Anonymous], 1993, FUNDAMENTAL SPEECH R
  • [2] Chai W., 2001, P INT C ART INT
  • [3] Dannenberg R. B., 1997, P INT COMP MUS C, P344
  • [4] Lambrou T, 1998, INT CONF ACOUST SPEE, P3621, DOI 10.1109/ICASSP.1998.679665
  • [5] LI T, 2003, ACM SIGIR C RES DEV, P282
  • [6] Pye D, 2000, INT CONF ACOUST SPEE, P2437, DOI 10.1109/ICASSP.2000.859334
  • [7] Soltau H, 1998, INT CONF ACOUST SPEE, P1137, DOI 10.1109/ICASSP.1998.675470
  • [8] Musical genre classification of audio signals
    Tzanetakis, G
    Cook, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (05): : 293 - 302