Improving Word Alignment Through Morphological Analysis

被引:0
作者
Vuong Van Bui [1 ]
Thanh Trung Tran [1 ]
Nhat Bich Thi Nguyen [1 ]
Tai Dinh Pham [1 ]
Anh Ngoc Le [1 ]
Cuong Anh Le [1 ]
机构
[1] Univ Engn & Technol, Vietnam Natl Univ, Dept Comp Sci, Hanoi, Vietnam
来源
INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, IUKM 2015 | 2015年 / 9376卷
关键词
Machine translation; Word alignment; IBM models; Morphological analysis;
D O I
10.1007/978-3-319-25135-6_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word alignment plays a critical role in statistical machine translation systems. The famous word alignment system, IBM models series, currently operates on only surface forms of words regardless of their linguistic features. This deficiency usually leads to many data sparseness problems. Therefore, we present an extension that enables the integration of morphological analysis into the traditional IBM models. Experiments on English-Vietnamese tasks show that the new model produces better results not only in word alignment but also in final translation performance.
引用
收藏
页码:315 / 325
页数:11
相关论文
共 50 条
  • [41] Linguistics-based word alignment for medical translators
    Vanallemeersch, Tom
    Wermuth, Cornelia
    [J]. JOURNAL OF SPECIALISED TRANSLATION, 2008, (09) : 20 - 38
  • [42] Bootstrapping Word Alignment by automatically Generated Bilingual Dictionary
    Zhu, Danqing
    Chang, Baobao
    [J]. IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 311 - 317
  • [43] Constraining a Generative Word Alignment Model with Discriminative Output
    Goh, Chooi-Ling
    Watanabe, Taro
    Yamamoto, Hirofumi
    Sumita, Eiichiro
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (07) : 1976 - 1983
  • [44] WORD ALIGNMENT BASED ON MULTI-GRAIN MODEL
    He, Yanqing
    Zhou, Yu
    Zong, Chengqing
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 269 - 272
  • [45] A Hybrid Approach for Word Alignment with Statistical Modeling and Chunker
    Srivastava, Jyoti
    Sanyal, Sudip
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 570 - 581
  • [46] Measuring word alignment quality for statistical machine translation
    Fraser, Alexander
    Marcu, Daniel
    [J]. COMPUTATIONAL LINGUISTICS, 2007, 33 (03) : 293 - 303
  • [47] Unsupervised joint monolingual character alignment and word segmentation
    [J]. Teng, Zhiyang (tengzhiyang@ict.ac.cn), 1600, Springer Verlag (8801): : 1 - 12
  • [48] A word alignment model based on multiobjective evolutionary algorithms
    Chen, Yidong
    Shi, Xiaodong
    Zhou, Changle
    Hong, Qingyang
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 57 (11-12) : 1724 - 1729
  • [49] Hapax Legomena: Their Contribution in Number and Efficiency to Word Alignment
    Lardilleux, Adrien
    Lepage, Yves
    [J]. HUMAN LANGUAGE TECHNOLOGY: CHALLENGES OF THE INFORMATION SOCIETY, 2009, 5603 : 440 - 450
  • [50] Unsupervised Joint Monolingual Character Alignment and Word Segmentation
    Teng, Zhiyang
    Xiong, Hao
    Liu, Qun
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 1 - 12