Dynamic Spectral Envelope Modeling for Timbre Analysis of Musical Instrument Sounds

被引:21
作者
Burred, Juan Jose [1 ]
Roebel, Axel [1 ]
Sikora, Thomas [2 ]
机构
[1] IRCAM CNRS STMS, Anal Synth Team, F-75004 Paris, France
[2] Tech Univ Berlin, Commun Syst Grp, D-10587 Berlin, Germany
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2010年 / 18卷 / 03期
关键词
Gaussian processes; music information retrieval (MIR); sinusoidal modeling; spectral envelope; timbre model; CLASSIFICATION;
D O I
10.1109/TASL.2009.2036300
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a computational model of musical instrument sounds that focuses on capturing the dynamic behavior of the spectral envelope. A set of spectro-temporal envelopes belonging to different notes of each instrument are extracted by means of sinusoidal modeling and subsequent frequency interpolation, before being subjected to principal component analysis. The prototypical evolution of the envelopes in the obtained reduced-dimensional space is modeled as a nonstationary Gaussian Process. This results in a compact representation in the form of a set of prototype curves in feature space, or equivalently of prototype spectro-temporal envelopes in the time-frequency domain. Finally, the obtained models are successfully evaluated in the context of two music content analysis tasks: classification of instrument samples and detection of instruments in monaural polyphonic mixtures.
引用
收藏
页码:663 / 674
页数:12
相关论文
共 27 条
[21]  
Kitahara T, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS, P421
[22]   Musical instrument classification and duet analysis employing music information retrieval techniques [J].
Kostek, B .
PROCEEDINGS OF THE IEEE, 2004, 92 (04) :712-729
[23]   Instrument-specific harmonic atoms for mid-level music representation [J].
Leveau, Pierre ;
Vincent, Emmanuel ;
Richard, Gaeel ;
Daudet, Laurent .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (01) :116-128
[24]  
LIVSHIN A, 2006, P INT C MUS INF RETR
[25]  
LOUREIRO MA, 2004, P INT C MUS INF RETR
[26]  
Sandell GJ, 1995, J AUDIO ENG SOC, V43, P1013
[27]  
Schouten J.F., 1968, Reports of the 6th International Congress on Acoustics, V6, P35