Pitch estimation using models of voiced speech on three levels

被引:0
|
作者
Joho, Dominik [1 ]
Bennewitz, Maren [1 ]
Behnke, Sven [1 ]
机构
[1] Univ Freiburg, Dept Comp Sci, D-7800 Freiburg, Germany
关键词
pitch estimation; speech analysis; matrix decomposition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present an algorithm for estimating the fundamental frequency in speech signals. Our approach incorporates models of voiced speech on three levels. First, we estimate the pitch for each time frame based on its harmonic structure using non-negative matrix factorization. The second level utilizes temporal pitch continuity to extract partial pitch contours. Thirdly, we incorporate statistics of the succession of voiced segments to aggregate partial contours to the final contour of an utterance. We evaluate our approach on the Keele database. The experimental results show the robustness of our method for noisy speech, and the good performance for clean speech in comparison with state-of-the-art algorithms.
引用
收藏
页码:1077 / +
页数:2
相关论文
共 50 条
  • [1] A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation
    Hu, Guoning
    Wang, DeLiang
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 2067 - 2079
  • [2] ESTIMATING THE PITCH PERIOD OF VOICED SPEECH
    AMBIKAIRAJAH, E
    CAREY, MJ
    TATTERSALL, G
    ELECTRONICS LETTERS, 1980, 16 (12) : 464 - 466
  • [3] Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping
    Li, Ming
    Cao, Chuan
    Wang, Di
    Lu, Ping
    Fu, Qiang
    Yan, Yonghong
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 151 - 154
  • [4] PITCH SYNCHRONOUS SPECTRAL REPRESENTATIONS OF VOICED SPEECH
    MILLER, JE
    MATHEWS, MV
    DAVID, EE
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1960, 32 (07): : 913 - 914
  • [5] Using Gaussian processes to synthesise voiced speech with natural pitch variations
    McLaughlin, S
    Leith, D
    Mann, L
    DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 321 - 324
  • [6] Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
    Jackson, PJB
    Shadle, CH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (07): : 713 - 726
  • [7] HMM-based speech enhancement using pitch period information in voiced speech segments
    Oberle, S
    Kaelin, A
    ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 2645 - 2648
  • [8] Real-time pitch extraction of voiced speech
    George, DE
    Salari, E
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 1997, 20 (04) : 379 - 387
  • [9] Real-time pitch extraction of voiced speech
    Dept of Physics and Astronomy, University of Toledo, Toledo, OH 43606-3390, United States
    不详
    J Network Comput Appl, 4 (379-387):
  • [10] Extraction of voiced regions of speech from emotional speech signals using wavelet-pitch method
    Dendukuri L.S.
    Hussain S.J.
    Periodica polytechnica Electrical engineering and computer science, 2021, 65 (03): : 262 - 278