NMF-based Multiple Pitch Estimation Using Sparseness and Inter-frame Continuity Constraints

被引:0
作者
Fujisawa, Takanori [1 ]
Degawa, Ikuo [1 ]
Ikehara, Masaaki [1 ]
机构
[1] Keio Univ, EEE Dept, Yokohama, Kanagawa 2238522, Japan
来源
2014 IEEE 16TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2014年
关键词
NONNEGATIVE MATRIX FACTORIZATION; DIVERGENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes NMF-based (non-negative matrix factorization) multiple pitch estimation algorithm. The approach of NMF-based multiple pitch estimation is to decompose input magnitude spectrogram into sum of basis spectra representing individual pitches. In decomposing music signals, the amplitude of basis spectra should have sparseness, and the shape of amplitude should be continuous between neighbor temporal frames. We introduce the constraint using matrix norm to enforce these characteristic at once and propose new NMF algorithm for spectral decomposition with this constraint. The evaluation of solo piano music shows this algorithm can implement more robust pitch estimation in the place which input spectrum has certain different shape from basis spectra or the shape of input spectrum has temporal change.
引用
收藏
页数:5
相关论文
共 10 条
  • [1] Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis
    Fevotte, Cedric
    Bertin, Nancy
    Durrieu, Jean-Louis
    [J]. NEURAL COMPUTATION, 2009, 21 (03) : 793 - 830
  • [2] Hoyer PO, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P557, DOI 10.1109/NNSP.2002.1030067
  • [3] A generalized divergence measure for nonnegative matrix factorization
    Kompass, Raul
    [J]. NEURAL COMPUTATION, 2007, 19 (03) : 780 - 791
  • [4] Non-local Sparse Models for Image Restoration
    Mairal, Julien
    Bach, Francis
    Ponce, Jean
    Sapiro, Guillermo
    Zisserman, Andrew
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 2272 - 2279
  • [5] Nishimura T., 2002, ISMIR, P287
  • [6] Multiple fundamental frequency estimation using Gaussian smoothness
    Pertusa, Antonio
    Inesta, Jose M.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 105 - 108
  • [7] A discriminative model for polyphonic piano transcription
    Poliner, Graham E.
    Ellis, Daniel P. W.
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [8] A computationally efficient multipitch analysis model
    Tolonen, T
    Karjalainen, M
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06): : 708 - 716
  • [9] Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation
    Vincent, Emmanuel
    Bertin, Nancy
    Badeau, Roland
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 528 - 537
  • [10] Monaural sound source separation by nonnegative matrix factorization with tempora continuity and sparseness criteria
    Virtanen, Tuomas
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1066 - 1074