Pitch estimation using models of voiced speech on three levels

被引：0

作者：

Joho, Dominik ^{[1
]}

Bennewitz, Maren ^{[1
]}

Behnke, Sven ^{[1
]}

机构：

[1] Univ Freiburg, Dept Comp Sci, D-7800 Freiburg, Germany

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年

关键词：

pitch estimation; speech analysis; matrix decomposition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We present an algorithm for estimating the fundamental frequency in speech signals. Our approach incorporates models of voiced speech on three levels. First, we estimate the pitch for each time frame based on its harmonic structure using non-negative matrix factorization. The second level utilizes temporal pitch continuity to extract partial pitch contours. Thirdly, we incorporate statistics of the succession of voiced segments to aggregate partial contours to the final contour of an utterance. We evaluate our approach on the Keele database. The experimental results show the robustness of our method for noisy speech, and the good performance for clean speech in comparison with state-of-the-art algorithms.

引用

页码：1077 / +

页数：2

共 50 条

[1] A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation
Hu, Guoning
Wang, DeLiang
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 2067 - 2079
[2] ESTIMATING THE PITCH PERIOD OF VOICED SPEECH
AMBIKAIRAJAH, E
CAREY, MJ
TATTERSALL, G
ELECTRONICS LETTERS, 1980, 16 (12) : 464 - 466
[3] Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping
Li, Ming
Cao, Chuan
Wang, Di
Lu, Ping
Fu, Qiang
Yan, Yonghong
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 151 - 154
[4] PITCH SYNCHRONOUS SPECTRAL REPRESENTATIONS OF VOICED SPEECH
MILLER, JE
MATHEWS, MV
DAVID, EE
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1960, 32 (07): : 913 - 914
[5] Using Gaussian processes to synthesise voiced speech with natural pitch variations
McLaughlin, S
Leith, D
Mann, L
DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 321 - 324
[6] Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
Jackson, PJB
Shadle, CH
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (07): : 713 - 726
[7] HMM-based speech enhancement using pitch period information in voiced speech segments
Oberle, S
Kaelin, A
ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 2645 - 2648
[8] Real-time pitch extraction of voiced speech
George, DE
Salari, E
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 1997, 20 (04) : 379 - 387
[9] Real-time pitch extraction of voiced speech
Dept of Physics and Astronomy, University of Toledo, Toledo, OH 43606-3390, United States
不详
J Network Comput Appl, 4 (379-387):
[10] Extraction of voiced regions of speech from emotional speech signals using wavelet-pitch method
Dendukuri L.S.
Hussain S.J.
Periodica polytechnica Electrical engineering and computer science, 2021, 65 (03): : 262 - 278

← 1 2 3 4 5 →