Improving Word Alignment Through Morphological Analysis

被引:0
作者
Vuong Van Bui [1 ]
Thanh Trung Tran [1 ]
Nhat Bich Thi Nguyen [1 ]
Tai Dinh Pham [1 ]
Anh Ngoc Le [1 ]
Cuong Anh Le [1 ]
机构
[1] Univ Engn & Technol, Vietnam Natl Univ, Dept Comp Sci, Hanoi, Vietnam
来源
INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, IUKM 2015 | 2015年 / 9376卷
关键词
Machine translation; Word alignment; IBM models; Morphological analysis;
D O I
10.1007/978-3-319-25135-6_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word alignment plays a critical role in statistical machine translation systems. The famous word alignment system, IBM models series, currently operates on only surface forms of words regardless of their linguistic features. This deficiency usually leads to many data sparseness problems. Therefore, we present an extension that enables the integration of morphological analysis into the traditional IBM models. Experiments on English-Vietnamese tasks show that the new model produces better results not only in word alignment but also in final translation performance.
引用
收藏
页码:315 / 325
页数:11
相关论文
共 50 条
  • [31] Research on Deep Learning HMM Word Alignment
    Li, Dan
    Yu, Zheng-hong
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNIQUES AND APPLICATIONS, AITA 2016, 2016, : 139 - 143
  • [32] A Survey Paper on Performance Improvement of Word Alignment in English to Hindi Translation System
    Yadav, Kamala Kant
    Jaiswal, Umesh Chandra
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL (I2C2), 2017,
  • [33] Automatic word alignment tools to scale production of manually aligned parallel texts
    Grimes, Stephen
    Peterson, Katherine
    Li, Xuansong
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2194 - 2198
  • [34] Delineating store trade areas through morphological analysis
    Baray, Jerome
    Cliquet, Gerard
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 182 (02) : 886 - 898
  • [35] CREATIVITY STIMULATION IN CHAT CONVERSATIONS THROUGH MORPHOLOGICAL ANALYSIS
    Stamati, Daniela
    Dascalu, Mihai
    Trausan-Matu, Stefan
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2015, 77 (04): : 17 - 30
  • [36] Improving Part-of-Speech Tagging Accuracy for Croatian by Morphological Analysis
    Agic, Zeljko
    Dovedan, Zdravko
    Tadic, Marko
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2009, 33 (02): : 161 - 167
  • [37] Improving Part-of-Speech Tagging Accuracy for Croatian by Morphological Analysis
    Agic, Zeljko
    Dovedan, Zdravko
    Tadic, Marko
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2008, 32 (04): : 445 - 451
  • [38] Refining Kazakh Word Alignment Using Simulation Modeling Methods for Statistical Machine Translation
    Kartbayev, Amandyk
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 421 - 427
  • [39] Simpler Is Better: Re-evaluation of Default Word Alignment Models in Statistical MT
    Fishel, Mark
    PROCEEDINGS OF THE 24TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2010, : 381 - 388
  • [40] POS-based Word Alignment for Small Corpus
    Srivastava, Jyoti
    Sanyal, Sudip
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 37 - 40