Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization

被引:33
作者
Carabias-Orti, J. J. [1 ]
Virtanen, T. [2 ]
Vera-Candeas, P. [1 ]
Ruiz-Reyes, N. [1 ]
Canadas-Quesada, F. J. [1 ]
机构
[1] Univ Jaen, Telecommun Engn Dept, Jaen 23700, Spain
[2] Tampere Univ Technol, Dept Signal Proc, FI-33101 Tampere, Finland
关键词
Automatic music transcription; excitation-filter model; excitation modeling; non-negative matrix factorization (NMF); source-filter model; spectral analysis; MATRIX FACTORIZATION; AUTOMATIC TRANSCRIPTION; SOURCE SEPARATION; SMOOTHNESS; MELODY;
D O I
10.1109/JSTSP.2011.2159700
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents theoretical and experimental results about constrained non-negative matrix factorization (NMF) to model the excitation of the musical instruments. These excitations represent vibrating objects, while the filter represents the resonance structure of the instrument, which colors the produced sound. We propose to model the excitations as the weighted sum of harmonically constrained basis functions, whose parameters are tied across different pitches of an instrument. An NMF-based framework is used to learn the model parameters. We assume that the excitations of a well-tempered instrument should possess an identifiable characteristic structure whereas the conditions of the music scene might produce variations in the filter. In order to test the reliability of our proposal, we evaluate our method for a music transcription task in two scenarios. On the first one, comparison with state-of-the-art methods has been performed over a dataset of piano recordings obtaining more accurate results than other NMF-based algorithms. On the second one, two woodwind instrument databases have been used to demonstrate the benefits of our model in comparison with previous excitation-filter model approaches.
引用
收藏
页码:1144 / 1158
页数:15
相关论文
共 46 条
[11]   Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle [J].
Emiya, Valentin ;
Badeau, Roland ;
David, Bertrand .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06) :1643-1654
[12]   Separation of synchronous pitched notes by spectral filtering of harmonics [J].
Every, Mark R. ;
Szymanski, John E. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05) :1845-1856
[13]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[14]  
FITZGERALD D, 2008, COMPUTATIONAL INTELL
[15]  
Fletcher NevilleH., 1998, PHYS MUSICAL INSTRUM, V2d
[16]  
Goto M., 2004, P 18 INT C AC ICA 20, V1, P553
[17]  
Goto M, 2002, P 3 INT SOC MUS INF
[18]  
HEITTOLA T, 2009, P 10 INT SOC MUS INF
[19]   Sinusoidal modeling using psychoacoustic-adaptive matching pursuits [J].
Heusdens, R ;
Vafin, R ;
Kleijn, WB .
IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (08) :262-265
[20]  
Hogg R.V., 1989, Introduction To Mathematical Statistics