Maximum A-posteriori probability pitch tracking in noisy environments using harmonic model

被引:67
作者
Tabrikian, J [1 ]
Dubnov, S [1 ]
Dickalov, Y [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Elect & Comp Engn, IL-84105 Beer Sheva, Israel
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2004年 / 12卷 / 01期
基金
以色列科学基金会;
关键词
Cramer-Rao bound; harmonic model; MAP estimator; Markov model; maximum likelihood; noisy speech; PDA; pitch detection; pitch tracking; speech denoising;
D O I
10.1109/TSA.2003.819950
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Modern speech processing applications require operation on signal of interest that is contaminated by high level of noise. This situation calls for a greater robustness in estimation of the speech parameters, a task which is hard to achieve using standard speech models. In this paper, we present an optimal estimation procedure for sound signals (such as speech) that are modeled by harmonic sources. The harmonic model achieves more robust and accurate estimation of voiced speech parameters. Using maximum a posteriori probability framework, successful tracking of pitch parameters is possible in ultra low signal to noise conditions (as low as -15 dB). The performance of the method is evaluated using the Keele pitch detection database with realistic background noise. The results show best performance in comparison to other state-of-the-art pitch detectors. Application of the proposed algorithm in a simple speaker identification system shows significant improvement in the performance.
引用
收藏
页码:76 / 87
页数:12
相关论文
共 33 条