Updated MINDS Report on Speech Recognition and Understanding, Part 2

被引：28

作者：

Baker, Janet M. ^{[1
]}

Deng, Li ^{[2
,3
]}

Khudanpur, Sanjeev ^{[4
]}

Lee, Chin-Hui ^{[5
,6
]}

Glass, James R. ^{[7
]}

Morgan, Nelson ^{[8
,9
]}

O'Shaughnessy, Douglas ^{[10
]}

机构：

[1] Saras Inst, W Newton, MA USA

[2] Univ Washington, Seattle, WA 98195 USA

[3] Microsoft Res, Redmond, WA USA

[4] Johns Hopkins Univ, GWC Whiting Sch Engn, Baltimore, MD USA

[5] Georgia Inst Technol, Sch ECE, Atlanta, GA 30332 USA

[6] Bell Labs, Murray Hill, NJ 07974 USA

[7] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA

[8] Univ Calif Berkeley, ICSI, Res Lab, Berkeley, CA 94720 USA

[9] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

[10] Univ Quebec, INRS EMT, Ste Foy, PQ G1V 2M3, Canada

来源：

IEEE SIGNAL PROCESSING MAGAZINE | 2009年 / 26卷 / 04期

关键词：

BRAIN ACTIVITY; CONSTRAINTS; MODELS; WORDS;

D O I：

10.1109/MSP.2009.932707

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The second part of the updated version of "MINDS 2006-2007 Report of the Speech Understanding Working Group" is presented which came from two workshops entitled "Meeting of the MINDS: Future Directions for Human Language Technology". The specific topics being discussed include: the fundamental science of human speech perception and production; transcription to meaning extraction; understanding the cortical speech/language processing; the heterogeneous knowledge sources for automatic speech recognition; the information-bearing elements of the speech signal; the novel computational architectures for knowledge-rich speech recognition; the adaptation and self-learning in speech recognition systems; the robustness and context-awareness in acoustic models for speech recognition; the speaker's acoustic environment and the speech acquisition channel; the speaker characteristics and style; the language characteristics; robust speech recognition in everyday environments; and finally, the novel search procedures for knowledge-rich speech recognition.

引用

页码：78 / 85

页数：8

共 68 条

[1] Anastasakos T, 1997, INT CONF ACOUST SPEE, P1043, DOI 10.1109/ICASSP.1997.596119
[2] [Anonymous], 1997, Statistical methods for speech recognition
[3] [Anonymous], THESIS U PENNSYLVANI
[4] [Anonymous], 1997, The discovery of spoken language
[5] [Anonymous], P ICSLP
[6] [Anonymous], THESIS MIT CAMBRIDGE
[7] [Anonymous], 2006, Tech. rep.
[8] Aradilla G., 2005, Proc. Eurospeech, P3333
[9] Axelrod S, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P173
[10] Bahl L., 1986, INT C ACOUSTICS SPEE, P49

← 1 2 3 4 5 6 7 →