Estimation of the instantaneous pitch of speech

被引:30
|
作者
Resch, Barbara [1 ]
Nilsson, Mattias [1 ]
Ekman, Anders [1 ]
Kleijn, W. Bastiaan [1 ]
机构
[1] Royal Inst Technol, Sound & Image Proc Lab, KTH, S-10044 Stockholm, Sweden
关键词
instantaneous pitch; pitch estimation; pitch-synchronous processing; splines;
D O I
10.1109/TASL.2006.885242
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An accurate estimation of the pitch is essential for many speech processing applications, such as speech synthesis, speech coding, and speech enhancement. A widely used assumption in most common pitch estimation methods is that pitch is constant over a segment of short duration. This assumption does not apply in reality and leads to inaccurate pitch estimates. In this paper, we present a method for continuous pitch estimation that is able to track fast changes. In the presented framework, the pitch is modeled by a B-spline expansion and optimized in a multistage procedure for increased robustness. The performance of the continuous optimization procedure is compared to state-of-the-art pitch estimation methods and is evaluated both for artificial speech-like signals with known pitch, and for real speech signals. The results of the experiments show that our method leads to a higher accuracy of the estimate of the pitch than state-of-the-art methods.
引用
收藏
页码:813 / 822
页数:10
相关论文
共 50 条
  • [1] Model-Based Estimation of Instantaneous Pitch in Noisy Speech
    Hong, Jung Ook
    Wolfe, Patrick J.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 100 - 103
  • [2] Instantaneous Pitch Estimation of Noisy Speech Signal with Multivariate SST
    Molla, Md Khademul Islam
    Qaosar, Mahboob
    Hirose, Keikichi
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 770 - 773
  • [3] Speech formant frequency and pitch estimation using instantaneous complex frequency
    Kaniewska, Magdalena
    ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 493 - 496
  • [4] HARMONICS ESTIMATION BASED ON INSTANTANEOUS FREQUENCY AND ITS APPLICATION TO PITCH DETERMINATION OF SPEECH
    ABE, T
    KOBAYASHI, T
    IMAI, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (09) : 1188 - 1194
  • [5] Pitch estimation by block and instantaneous methods
    Gavat I.
    Zirra M.
    Sabac B.
    International Journal of Speech Technology, 2002, 5 (3) : 269 - 279
  • [6] INSTANTANEOUS PITCH ESTIMATION BASED ON RAPT FRAMEWORK
    Azarov, Elias
    Vashkevich, Maxim
    Petrovsky, Alexander
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2787 - 2791
  • [7] INSTANTANEOUS COMPLEX FREQUENCY FOR PIPELINE PITCH ESTIMATION
    Kaniewska, Magdalena
    SPA 2010: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2010, : 83 - 88
  • [8] IFE: NN-aided Instantaneous Pitch Estimation
    Blok, Marek
    Balla, Jan
    Pietrolaj, Mariusz
    2021 14TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION, HSI, 2021, : 78 - 84
  • [9] Instantaneous Pitch Estimation Based on Empirical Wavelet Transform
    Li, Yusheng
    Xue, Biao
    Hong, Hong
    Zhu, Xiaohua
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 250 - 253
  • [10] INSTANTANEOUS PITCH ESTIMATION ALGORITHM BASED ON MULTIRATE SAMPLING
    Azarov, Elias
    Vashkevich, Maxim
    Petrovsky, Alexander
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4970 - 4974