A unified framework for translation and understanding allowing discriminative joint decoding for multilingual speech semantic interpretation

被引：2

作者：

Jabaian, Bassam ^{[1
]}

Lefevre, Fabrice ^{[1
]}

Besacier, Laurent ^{[2
]}

机构：

[1] Univ Avignon, LIA CERI, Avignon, France

[2] Univ Grenoble 1, LIG, Grenoble, France

来源：

COMPUTER SPEECH AND LANGUAGE | 2016年 / 35卷

关键词：

Multilingual speech understanding; Conditional random fields; Hypothesis graphs; Statistical machine translation; Dialogue systems; SYSTEM;

D O I：

10.1016/j.csl.2014.06.003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Probabilistic approaches are now widespread in most natural language processing applications and selection of a particular approach usually depends on the task at hand. Targeting speech semantic interpretation in a multilingual context, this paper presents a comparison between the state-of-the-art methods used for machine translation and speech understanding. This comparison justifies our proposition of a unified framework for both tasks based on a discriminative approach. We demonstrate that this framework can be used to perform a joint translation-understanding decoding which allows to combine, in the same process, translation and semantic tagging scores of a sentence. A cascade of finite-state transducers is used to compose the translation and understanding hypothesis graphs (1-bests, word graphs or confusion networks). Not only this proposition is competitive with the state-of-the-art techniques but also its framework is even more attractive as it can be generalized to other components of human machine vocal interfaces (e.g. speech recognizer) so as to allow a richer transmission of information between them. (C) 2014 Elsevier Ltd. All rights reserved.

引用

页码：185 / 199

页数：15

共 52 条

[31]

Lefevre F., 2006, IEEE ACL WORKSH SPOK

[32]

Lefèvre F, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P78

[33]

Liang Percy, 2006, Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, P104, DOI [DOI 10.3115/1220835.1220849, 10.3115/1220835.1220849]

[34] Applications of Statistical Machine Translation Approaches to Spoken Language Understanding [J].

Macherey, Klaus ;

Bender, Oliver ;

Ney, Hermann .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04) :803-818

[35]

Macherey Klaus., 2001, Proceedings of the 7th European Conference on Speech Communication and Technology (EuroSpeech-01), P2205

[36] Improving statistical MT by coupling reordering and decoding [J].

Maria Crego, Josep ;

Marino, Jose B. .

MACHINE TRANSLATION, 2006, 20 (03) :199-215

[37] N-gram-based machine translation [J].

Marino, Jose B. ;

Banchs, Rafael E. ;

Crego, Josep M. ;

de Gispert, Adria ;

Lambert, Patrik ;

Fonollosa, Jose A. R. ;

Costa-jussa, Marta R. .

COMPUTATIONAL LINGUISTICS, 2006, 32 (04) :527-549

[38]

Meurs M.J., 2009, IEEE INT C AUD SIGN

[39]

Minker W., 1998, DTSCH HOCHSCHULSCHRI

[40]

Och FJ, 2002, 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P295

← 1 2 3 4 5 6 →