Research Developments and Directions in Speech Recognition and Understanding, Part 1

被引:120
作者
Baker, Janet M. [1 ,2 ]
Deng, Li [3 ,4 ]
Glass, James [5 ]
Khudanpur, Sanjeev [6 ]
Lee, Chin-Hui [7 ,8 ]
Morgan, Nelson [9 ,10 ]
O'Shaughnessy, Douglas [11 ]
机构
[1] Dragon Syst, W Newton, MA USA
[2] Saras Inst, W Newton, MA USA
[3] Microsoft Res, Redmond, WA USA
[4] Univ Washington, Seattle, WA 98195 USA
[5] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[6] Johns Hopkins Univ, GWC Whiting Sch Engn, Baltimore, MD 21218 USA
[7] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[8] Bell Labs, Murray Hill, NJ 07974 USA
[9] Univ Calif Berkeley, Independent Nonprofit Res Lab, Berkeley, CA 94720 USA
[10] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA
[11] Univ Quebec, INRS EMT, Ste Foy, PQ G1V 2M3, Canada
关键词
MAXIMUM-LIKELIHOOD;
D O I
10.1109/MSP.2009.932166
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A review with regards to the major developments in automatic speech recognition (ASR) in five areas has been given. These five areas are infrastructure, knowledge representation, models and algorithms, search, and metadata. Meanwhile, other topics have also been discussed which includes: everyday audio, rapid portability, self-adaptive language capabilities, cognition-derived speech and language systems, and spoken-language comprehension.
引用
收藏
页码:75 / 80
页数:6
相关论文
共 51 条
[1]  
[Anonymous], 1997, Statistical methods for speech recognition
[2]  
[Anonymous], SPRINGER TEXTS ELECT
[3]  
[Anonymous], P ICSLP
[4]  
[Anonymous], 2000, Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition
[5]   Estimating Hidden Markov Model Parameters So As To Maximize Speech Recognition Accuracy [J].
Bahl, Lalit R. ;
Brown, Peter F. ;
de Souza, Peter V. ;
Mercer, Robert L. .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (01) :77-83
[6]  
Baker J.K., 1975, Speech Recognition
[7]  
BAKER JK, 2006, P INT C UN DIG LIB A
[8]  
Baum L. E., 1972, Inequalities, V3, P1
[9]  
BEAUFAYS F, 2002, HDB BRAIN THEORY NEU
[10]  
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics