A unified framework for translation and understanding allowing discriminative joint decoding for multilingual speech semantic interpretation

被引:2
作者
Jabaian, Bassam [1 ]
Lefevre, Fabrice [1 ]
Besacier, Laurent [2 ]
机构
[1] Univ Avignon, LIA CERI, Avignon, France
[2] Univ Grenoble 1, LIG, Grenoble, France
关键词
Multilingual speech understanding; Conditional random fields; Hypothesis graphs; Statistical machine translation; Dialogue systems; SYSTEM;
D O I
10.1016/j.csl.2014.06.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Probabilistic approaches are now widespread in most natural language processing applications and selection of a particular approach usually depends on the task at hand. Targeting speech semantic interpretation in a multilingual context, this paper presents a comparison between the state-of-the-art methods used for machine translation and speech understanding. This comparison justifies our proposition of a unified framework for both tasks based on a discriminative approach. We demonstrate that this framework can be used to perform a joint translation-understanding decoding which allows to combine, in the same process, translation and semantic tagging scores of a sentence. A cascade of finite-state transducers is used to compose the translation and understanding hypothesis graphs (1-bests, word graphs or confusion networks). Not only this proposition is competitive with the state-of-the-art techniques but also its framework is even more attractive as it can be generalized to other components of human machine vocal interfaces (e.g. speech recognizer) so as to allow a richer transmission of information between them. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:185 / 199
页数:15
相关论文
共 52 条
[31]  
Lefevre F., 2006, IEEE ACL WORKSH SPOK
[32]  
Lefèvre F, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P78
[33]  
Liang Percy, 2006, Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, P104, DOI [DOI 10.3115/1220835.1220849, 10.3115/1220835.1220849]
[34]   Applications of Statistical Machine Translation Approaches to Spoken Language Understanding [J].
Macherey, Klaus ;
Bender, Oliver ;
Ney, Hermann .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04) :803-818
[35]  
Macherey Klaus., 2001, Proceedings of the 7th European Conference on Speech Communication and Technology (EuroSpeech-01), P2205
[36]   Improving statistical MT by coupling reordering and decoding [J].
Maria Crego, Josep ;
Marino, Jose B. .
MACHINE TRANSLATION, 2006, 20 (03) :199-215
[37]   N-gram-based machine translation [J].
Marino, Jose B. ;
Banchs, Rafael E. ;
Crego, Josep M. ;
de Gispert, Adria ;
Lambert, Patrik ;
Fonollosa, Jose A. R. ;
Costa-jussa, Marta R. .
COMPUTATIONAL LINGUISTICS, 2006, 32 (04) :527-549
[38]  
Meurs M.J., 2009, IEEE INT C AUD SIGN
[39]  
Minker W., 1998, DTSCH HOCHSCHULSCHRI
[40]  
Och FJ, 2002, 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P295