Speech production knowledge in automatic speech recognition

被引：130

作者：

King, Simon

Frankel, Joe

Livescu, Karen

McDermott, Erik

Richmond, Korin

Wester, Mirjam

机构：

[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland

[2] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA

[3] NTT Corp, Commun Sci Labs, Kyoto 6190237, Japan

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2007年 / 121卷 / 02期

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1121/1.2404622

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Although much is known about. how speech is produced, and research into speech production has resulted in measured articulatory data, feature systems of different kinds, and numerous models, speech production knowledge is almost totally ignored in current mainstream approaches to automatic speech recognition. Representations of speech production allow simple explanations for many phenomena observed in speech which cannot be easily analyzed from either acoustic signal or phonetic transcription alone. In this article, a survey of a growing body of work in which such representations are used to improve automatic speech recognition is provided. (c) 2007 Acoustical Society of America.

引用

页码：723 / 742

页数：20

共 50 条

[1] Speech production and automatic speech recognition
Acoustics Bulletin, 2000, 25 (02):
[2] THE USE OF SPEECH KNOWLEDGE IN AUTOMATIC SPEECH RECOGNITION
ZUE, VW
PROCEEDINGS OF THE IEEE, 1985, 73 (11) : 1602 - 1615
[3] Speech production parameters for automatic speech recognition
McGowan, RS
Faber, A
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (01): : 28 - 28
[4] AN INTEGRATED KNOWLEDGE BASE FOR SPEECH SYNTHESIS AND AUTOMATIC SPEECH RECOGNITION
TATHAM, MAA
JOURNAL OF PHONETICS, 1985, 13 (02) : 175 - 188
[5] Leveraging Speech Production Knowledge for Improved Speech Recognition
Sangwan, Abhijeet
Hansen, John H. L.
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 58 - 63
[6] The potential role of speech production models in automatic speech recognition
Rose, RC
Schroeter, J
Sondhi, MM
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (03): : 1699 - 1709
[7] Potential role of speech production models in automatic speech recognition
J Acoust Soc Am, 3 (1699):
[8] Prosodic knowledge sources for automatic speech recognition
Vergyri, D
Stolcke, A
Gadde, VRR
Ferrer, L
Shriberg, E
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 208 - 211
[9] Critique: The potential role of speech production models in automatic speech recognition
Moore, Roger K.
1710, American Inst of Physics, Woodbury, NY, USA (99):
[10] Critique: The potential role of speech production models in automatic speech recognition
Moore, RK
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (03): : 1710 - 1713

← 1 2 3 4 5 →