Toward intelligent music information retrieval

被引:103
|
作者
Li, Tao [1 ]
Ogihara, Mitsunori
机构
[1] Florida Int Univ, Sch Comp Sci, Miami, FL 33199 USA
[2] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
clustering; FFT; machine learning; music information retrieval; wavelet;
D O I
10.1109/TMM.2006.870730
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient and intelligent music information retrieval is a very important topic of the 21st century. With the ultimate goal of building personal music information retrieval systems, this paper studies the problem of intelligent music information retrieval. Huron [10] points out that since the preeminent functions of music are social and psychological, the most useful characterization would be based on four types of information: genre, emotion, style, and similarity. This paper introduces Daubechies Wavelet Coefficient Histograms (DWCH) for music feature extraction for music information retrieval. The histograms are computed from the coefficients of the db(8) Daubechies wavelet filter applied to 3 s of music. A comparative study of sound features and classification algorithms on a dataset compiled by Tzanetakis shows that combining DWCH with timbral features (MFCC and FFT), with the use of multiclass extensions of support vector machine, achieves approximately 80% of accuracy, which is a significant improvement over the previously known result on this dataset. On another dataset the combination achieves 75% of accuracy. The paper also studies the issue of detecting emotion in music. Rating of two subjects in the three bipolar adjective pairs are used. The accuracy of around 70% was achieved in predicting emotional labeling in these adjective pairs. The paper also studies the problem of identifying groups of artists based on their lyrics and sound using a semi-supervised classification algorithm. Identification of artist groups based on the Similar Artist lists at All Music Guide is attempted. The semi-supervised learning algorithm resulted in nontrivial increases in the accuracy to more than 70%. Finally, the paper conducts a proof-of-concept experiment on similarity search using the feature set.
引用
收藏
页码:564 / 574
页数:11
相关论文
共 50 条