A Simple Continuous Pitch Estimation Algorithm

被引:33
作者
Garner, Philip N. [1 ]
Cernak, Milos [1 ]
Motlicek, Petr [1 ]
机构
[1] Idiap Res Inst, Martigny, Switzerland
关键词
Kalman smoother; pitch estimation; speech coding; speech parameterization; FUNDAMENTAL-FREQUENCY; SPEECH; HMM;
D O I
10.1109/LSP.2012.2231675
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent work in text to speech synthesis has pointed to the benefit of using a continuous pitch estimate; that is, one that records pitch even when voicing is not present. Such an approach typically requires interpolation. The purpose of this letter is to show that a continuous pitch estimation is available from a combination of otherwise well known techniques. Further, in the case of an autocorrelation based estimate, the continuous requirement negates the need for other heuristics to correct for common errors. An algorithm is suggested, illustrated, and demonstrated using a parametric vocoder.
引用
收藏
页码:102 / 105
页数:4
相关论文
共 17 条
[1]  
[Anonymous], 2009, SYNTHESIS LECT SPEEC
[2]  
Boersma P., 1993, Institute of Phonetic Sciences, University of Amsterdam, Proceedings 17 (1993) 97-110, P97
[3]   YIN, a fundamental frequency estimator for speech and music [J].
de Cheveigné, A ;
Kawahara, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (04) :1917-1930
[4]  
Freij G. J., 1988, ICASSP 88: 1988 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.88CH2561-9), P135, DOI 10.1109/ICASSP.1988.196530
[5]  
Garner PN, 1998, INT CONF ACOUST SPEE, P1, DOI 10.1109/ICASSP.1998.674352
[6]   MODELING INTONATION CONTOURS AT THE PHRASE LEVEL USING CONTINUOUS DENSITY HIDDEN MARKOV-MODELS [J].
JENSEN, U ;
MOORE, RK ;
DALSGAARD, P ;
LINDBERG, B .
COMPUTER SPEECH AND LANGUAGE, 1994, 8 (03) :247-260
[7]   Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction:: Possible role of a repetitive structure in sounds [J].
Kawahara, H ;
Masuda-Katsuse, I ;
de Cheveigné, A .
SPEECH COMMUNICATION, 1999, 27 (3-4) :187-207
[8]  
Kawahara H., 1999, P EUROSPEECH BUD HUN
[9]  
Latorre J, 2011, INT CONF ACOUST SPEE, P4724
[10]  
Nielsen JK, 2012, INT CONF ACOUST SPEE, P4617, DOI 10.1109/ICASSP.2012.6288947