Learning dependency transduction models from unannotated examples

被引:2
作者
Alshawi, H [1 ]
Douglas, S [1 ]
机构
[1] AT&T Labs Res, Florham Park, NJ 07932 USA
来源
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2000年 / 358卷 / 1769期
关键词
statistical machine translation; automata; dependency grammar;
D O I
10.1098/rsta.2000.0591
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a method for constructing a statistical machine translation system automatically from unannotated examples in a manner consistent with the principles of dependency grammar. The method involves learning a generative statistical model of paired dependency derivations of source and target sentences. Such a dependency transduction model consists of collections of weighted head transducers. Head transducers are finite-state machines with different formal properties from 'standard' finite-state transducers. When applied to machine translation, the acquired head transducers are applied 'middle out', efficiently converting source head words and dependents directly into their counterparts in the target language. We present experimental results on the accuracy of our models for English-Spanish and English-Japanese translation, the training examples being pairs of transcribed spontaneous utterances and their translations. A hierarchical decomposition of bi-language strings emerges from our training process; this decomposition may or may not correspond to familiar linguistic phrase structure. However, no explicit semantic representations are involved, suggesting an approach to language processing in which natural language itself is the semantic representation.
引用
收藏
页码:1357 / 1370
页数:14
相关论文
共 17 条
[1]  
ALSHAWI H, 1996, P INT C SPOK LANG PR
[2]  
Alshawi H., 1996, SEMANTIC AMBIGUITY U, P145
[3]  
ALSHAWI H, 1998, P INT C SPOK LANG PR
[4]  
ALSHAWI H, 1997, 35 A M ASS COMP LING
[5]  
BOOTH T, 1969, 10 A IEEE S SWITCH A
[6]  
Brown P. F., 1993, Computational Linguistics, V19, P263
[7]  
Brown P. F., 1990, Computational Linguistics, V16, P79
[8]   AN EFFICIENT CONTEXT-FREE PARSING ALGORITHM [J].
EARLEY, J .
COMMUNICATIONS OF THE ACM, 1970, 13 (02) :94-&
[9]  
Gale W.A., 1991, SPEECH NATURAL LANGU, P152
[10]   DEPENDENCY THEORY - A FORMALISM AND SOME OBSERVATIONS [J].
HAYS, DG .
LANGUAGE, 1964, 40 (04) :511-525