Feature dependence in the automatic identification of musical woodwind instruments

被引:76
作者
Brown, JC [1 ]
Houix, O
McAdams, S
机构
[1] Wellesley Coll, Dept Phys, Wellesley, MA 02181 USA
[2] MIT, Media Lab, Cambridge, MA 02139 USA
[3] IRCAM, CNRS, F-75004 Paris, France
关键词
D O I
10.1121/1.1342075
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The automatic identification of musical instruments is a relatively unexplored and potentially very important field for its promise to free humans from time-consuming searches on the Internet and indexing of audio material. Speaker identification techniques have been used in this paper to determine the properties (features) which are most effective in identifying a statistically significant number of sounds representing four classes of musical instruments (oboe, sax, clarinet, flute) excerpted from actual performances. Features examined include cepstral coefficients, constant-Q coefficients, spectral centroid, autocorrelation coefficients, and moments of the time wave. The number of these coefficients was varied, and in the case of cepstral coefficients, ten coefficients were sufficient for identification. Correct identifications of 79%-84% were obtained with cepstral coefficients, bin-to-bin differences of the constant-Q coefficients, and autocorrelation coefficients; the latter have not been used previously in either speaker or instrument identification work. These results depended on the training sounds chosen and the number of clusters used in the calculation. Comparison to a human perception experiment with sounds produced by the same instruments indicates that, under these conditions, computers do as well as humans in identifying woodwind instruments. (C) 2001 Acoustical Society of America.
引用
收藏
页码:1064 / 1072
页数:9
相关论文
共 31 条
[1]  
[Anonymous], 1999, DISSERTATION
[2]  
BEAUCHAMP JW, 1982, J AUDIO ENG SOC, V30, P396
[3]   SOME FACTORS IN RECOGNITION OF TIMBRE [J].
BERGER, KW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1964, 36 (10) :1888-&
[4]  
Brown J., 1998, P INT S MUS AC, P291
[5]   Computer identification of musical instruments using pattern recognition with cepstral coefficients as features [J].
Brown, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (03) :1933-1941
[6]  
BROWN JC, 1997, J ACOUST SOC AM, V101, P3167
[7]  
BROWN JC, 1998, J ACOUST SOC AM, V103, P1889
[8]  
Campbell W. C., 1978, P RES S PSYCH AC MUS, P30
[9]  
Clark M., 1964, J. Audio Eng. Soc, V12, P28
[10]   Polyspectra as measures of sound texture and timbre [J].
Dubnov, S ;
Tishby, N ;
Cohen, D .
JOURNAL OF NEW MUSIC RESEARCH, 1997, 26 (04) :277-314