Iteratively constrained selection of word alignment links using knowledge and statistics

被引:1
作者
Lee, Jonghoon [1 ]
Lee, Sungjin [1 ]
Noh, Hyeongjong [1 ]
Lee, Kyusong [1 ]
Lee, Gary Geunbae [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Comp Sci & Engn, Pohang, South Korea
关键词
Bilingual resource; Parallel text; Machine translation; Word alignment; Korean-English;
D O I
10.1016/j.knosys.2011.05.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word alignment is a crucial component in applications that use bilingual resources. Statistical methods are widely used because they are portable and allow simple system building. However, pure statistical methods often incorrectly align functional words in the English-Korean language pair due to differences in the typology of the languages and a lack of knowledge. Knowledge is inevitably required to correct errors and to improve word alignment quality. In this paper, we introduce an effective method that uses an iterative process to incorporate knowledge into the word alignment system. The method achieved significant improvements in word alignment and its application: statistical machine translation. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1120 / 1130
页数:11
相关论文
共 20 条
[1]  
[Anonymous], ACL 2007
[2]  
[Anonymous], JOINT C EMP METH NAT
[3]  
[Anonymous], 1993, Proceedings of the Workshop on Very Large Corpora
[4]  
[Anonymous], C EMP METH NAT LANG
[5]  
[Anonymous], P 3 WORKSH STAT MACH
[6]  
[Anonymous], ANNOTATION STYLE GUI
[7]  
[Anonymous], P 2003 C N AM CHAPT
[8]  
[Anonymous], P 40 ANN M ASS COMP
[9]  
[Anonymous], KNOWLEDGE BASED SYST
[10]  
Brown P. F., 1993, Computational Linguistics, V19, P263