Multipitch tracking based on linear programming relaxation and sparsity-based pitch candidate estimation

被引:0
作者
Huang, Feng [1 ]
Lee, Tan
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China
来源
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年
关键词
Robust pitch estimation; multipitch tracking; linear programming; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a linear programming approach for tracking fundamental frequencies in acoustic signal that contains multiple speech sources and noise interference. A sparsity-based pitch estimation method is used to obtain pitch candidates for each signal frame. With conventional methods like exhaustive searching, the computational complexity of multipitch tracking grows exponentially with the number of pitch tracks. We propose to use a linear programming relaxation approach to solve the multiple pitch track searching problem. This approach has low computational complexity while it was found to attain global optimal solution with high probability. Experimental results show that the proposed algorithm is more efficient and more accurate than the conventional tracking method, extended dynamic programming.
引用
收藏
页码:331 / +
页数:3
相关论文
共 22 条
  • [1] [Anonymous], 1983, PITCH DETERMINATION, DOI DOI 10.1007/978-3-642-81926-1
  • [2] An audio-visual corpus for speech perception and automatic speech recognition (L)
    Cooke, Martin
    Barker, Jon
    Cunningham, Stuart
    Shao, Xu
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05) : 2421 - 2424
  • [3] Every M. R., 2006, P INT 2006
  • [4] Gu Y. H., 1992, ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech and Signal Processing (Cat. No.92CH3103-9), P21, DOI 10.1109/ICASSP.1992.226130
  • [5] Guo L, 2012, PROCEEDINGS OF THE 2012 INTERNATIONAL WORKSHOP ON METAMATERIALS (META)
  • [6] Heckmann Martin., 2007, Proceedings of Interspeech, P2765
  • [7] Huang F, 2013, INT CONF ACOUST SPEE, P6807, DOI 10.1109/ICASSP.2013.6638980
  • [8] Huang F, 2012, INT CONF ACOUST SPEE, P4601, DOI 10.1109/ICASSP.2012.6288943
  • [9] Pitch Estimation in Noisy Speech Using Accumulated Peak Spectrum and Sparse Estimation Technique
    Huang, Feng
    Lee, Tan
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 97 - 107
  • [10] Huang F, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P641