JOINT LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING

被引：0

作者：

Bayer, Ali Orkan ^{[1
]}

Riccardi, Giuseppe ^{[1
]}

机构：

[1] Univ Trento, Signals & Interact Syst Lab, Trento, Italy

来源：

2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012) | 2012年

关键词：

Spoken Language Understanding; Automatic Speech Recognition; Language Modeling; Recurrent Neural Networks;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Language models (LMs) are one of the main knowledge sources used by automatic speech recognition (ASR) and Spoken Language Understanding (SLU) systems. In ASR systems they are optimized to decode words from speech for a transcription task. In SLU systems they are optimized to map words into concept constructs or interpretation representations. Performance optimization is generally designed independently for ASR and SLU models in terms of word accuracy and concept accuracy respectively. However, the best word accuracy performance does not always yield the best understanding performance. In this paper we investigate how LMs originally trained to maximize word accuracy can be parametrized to account for speech understanding constraints and maximize concept accuracy. Incremental reduction in concept error rate is observed when a LM is trained on word-to-concept mappings. We show how to optimize the joint transcription and understanding task performance in the lexical-semantic relation space.

引用

页码：199 / 203

页数：5

共 11 条

[1] A neural probabilistic language model [J].

Bengio, Y ;

Ducharme, R ;

Vincent, P ;

Jauvin, C .

JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155

[2]

Bisani M., 2004, P ICASSP

[3]

Dinarelli M., 2009, P SRSL 2009 WORKSH E

[4]

Mikolov T, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P1045

[5]

Mikolov T, 2011, INT CONF ACOUST SPEE, P5528

[6]

Raymond C., 2007, INTERSPEECH

[7]

Riccardi G., 1998, P ICSLP

[8] Continuous space language models [J].

Schwenk, Holger .

COMPUTER SPEECH AND LANGUAGE, 2007, 21 (03) :492-518

[9] Is word error rate a good indicator for spoken language understanding accuracy [J].

Wang, YY ;

Acero, A ;

Chelba, C .

ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, :577-582

[10]

Yeh Alexander., 2000, Proceedings of the 18th conference on Computational linguistics - Volume 2, COLING '00, V2, P947

← 1 2 →