Estimation of the instantaneous pitch of speech

被引:30
|
作者
Resch, Barbara [1 ]
Nilsson, Mattias [1 ]
Ekman, Anders [1 ]
Kleijn, W. Bastiaan [1 ]
机构
[1] Royal Inst Technol, Sound & Image Proc Lab, KTH, S-10044 Stockholm, Sweden
关键词
instantaneous pitch; pitch estimation; pitch-synchronous processing; splines;
D O I
10.1109/TASL.2006.885242
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An accurate estimation of the pitch is essential for many speech processing applications, such as speech synthesis, speech coding, and speech enhancement. A widely used assumption in most common pitch estimation methods is that pitch is constant over a segment of short duration. This assumption does not apply in reality and leads to inaccurate pitch estimates. In this paper, we present a method for continuous pitch estimation that is able to track fast changes. In the presented framework, the pitch is modeled by a B-spline expansion and optimized in a multistage procedure for increased robustness. The performance of the continuous optimization procedure is compared to state-of-the-art pitch estimation methods and is evaluated both for artificial speech-like signals with known pitch, and for real speech signals. The results of the experiments show that our method leads to a higher accuracy of the estimate of the pitch than state-of-the-art methods.
引用
收藏
页码:813 / 822
页数:10
相关论文
共 50 条
  • [31] Nonlinear estimation of DEGG signals with applications to speech pitch detection
    Barner, KE
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2243 - 2246
  • [32] Improving the harmonic structure of speech spectrum for robust pitch estimation
    Chowdhury, Husne Ara
    Rahman, Mohammad Shahidur
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2025, 46 (01) : 34 - 37
  • [33] Improving pitch estimation for efficient multiband excitation coding of speech
    Chan, CF
    Yu, EWM
    ELECTRONICS LETTERS, 1996, 32 (10) : 870 - 872
  • [34] Improving pitch estimation for efficient multiband excitation coding of speech
    City Univ of Hong Kong, Kowloon, Hong Kong
    Electron Lett, 10 (870-872):
  • [35] Pitch Estimation of Marathi Spoken Numbers in Various Speech Signals
    Nimbhore, S. S.
    Ramteke, G. D.
    Ramteke, R. J.
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 405 - 409
  • [36] Fully-Convolutional Network for Pitch Estimation of Speech Signals
    Ardaillon, Luc
    Roebel, Axel
    INTERSPEECH 2019, 2019, : 2005 - 2009
  • [37] LACOPE: Latency-Constrained Pitch Estimation for Speech Enhancement
    Schroeter, Hendrik
    Rosenkranz, Tobias
    Escalante-B, Alberto N.
    Maier, Andreas
    INTERSPEECH 2021, 2021, : 656 - 660
  • [38] Robust Speech/Non-Speech Discrimination Based on Pitch Estimation for Mobile Robots
    Grondin, Francois
    Michaud, Francois
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 1650 - 1655
  • [39] A Method for Pitch Estimation from Noisy Speech Signals Based on a Pitch-Harmonic Extraction
    Shahnaz, C.
    Zhu, W. -P.
    Ahmad, M. O.
    2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 120 - 123
  • [40] AN INVESTIGATION INTO INSTANTANEOUS FREQUENCY ESTIMATION METHODS FOR IMPROVED SPEECH RECOGNITION FEATURES
    Nayak, Shekhar
    Bhati, Saurabhchand
    Murty, K. Sri Rama
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 363 - 367