NMF-based Multiple Pitch Estimation Using Sparseness and Inter-frame Continuity Constraints

被引:0
作者
Fujisawa, Takanori [1 ]
Degawa, Ikuo [1 ]
Ikehara, Masaaki [1 ]
机构
[1] Keio Univ, EEE Dept, Yokohama, Kanagawa 2238522, Japan
来源
2014 IEEE 16TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2014年
关键词
NONNEGATIVE MATRIX FACTORIZATION; DIVERGENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes NMF-based (non-negative matrix factorization) multiple pitch estimation algorithm. The approach of NMF-based multiple pitch estimation is to decompose input magnitude spectrogram into sum of basis spectra representing individual pitches. In decomposing music signals, the amplitude of basis spectra should have sparseness, and the shape of amplitude should be continuous between neighbor temporal frames. We introduce the constraint using matrix norm to enforce these characteristic at once and propose new NMF algorithm for spectral decomposition with this constraint. The evaluation of solo piano music shows this algorithm can implement more robust pitch estimation in the place which input spectrum has certain different shape from basis spectra or the shape of input spectrum has temporal change.
引用
收藏
页数:5
相关论文
共 10 条
[1]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[2]  
Hoyer PO, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P557, DOI 10.1109/NNSP.2002.1030067
[3]   A generalized divergence measure for nonnegative matrix factorization [J].
Kompass, Raul .
NEURAL COMPUTATION, 2007, 19 (03) :780-791
[4]   Non-local Sparse Models for Image Restoration [J].
Mairal, Julien ;
Bach, Francis ;
Ponce, Jean ;
Sapiro, Guillermo ;
Zisserman, Andrew .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :2272-2279
[5]  
Nishimura T., 2002, ISMIR, P287
[6]   Multiple fundamental frequency estimation using Gaussian smoothness [J].
Pertusa, Antonio ;
Inesta, Jose M. .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :105-108
[7]   A discriminative model for polyphonic piano transcription [J].
Poliner, Graham E. ;
Ellis, Daniel P. W. .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
[8]   A computationally efficient multipitch analysis model [J].
Tolonen, T ;
Karjalainen, M .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06) :708-716
[9]   Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation [J].
Vincent, Emmanuel ;
Bertin, Nancy ;
Badeau, Roland .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03) :528-537
[10]   Monaural sound source separation by nonnegative matrix factorization with tempora continuity and sparseness criteria [J].
Virtanen, Tuomas .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03) :1066-1074