A bilingual word alignment algorithm of Vietnamese-Chinese based on feature constraint

被引:0
作者
Yuanyuan Mo
Jianyi Guo
Zhengtao Yu
Lin Luo
Shengxiang Gao
机构
[1] Kunming University of Science and Technology,School of Information Engineering and Automation
[2] Kunming University of Science and Technology,Key Lab of Intelligent Information Processing
来源
International Journal of Machine Learning and Cybernetics | 2015年 / 6卷
关键词
Vietnamese; Chinese; Word Alignment; Log-Linear Model;
D O I
暂无
中图分类号
学科分类号
摘要
It is difficult to achieve auto-alignment between Vietnamese and Chinese, because their syntax and structure are quite different. In this case we present a novel method for the Vietnamese-Chinese word alignment which merges a variety of feature constraint models. In this article, an improved model based on the Vietnamese-Chinese progressive structure and offset features of word sequence is described. From this model which is trained by a log-linear model framework, and with parameters trained by the minimum error rate algorithm, the result of the Vietnamese-Chinese auto-alignment is obtained. The basic model of the experiments is IBM Model 3, and as experimental results suggest, this bilingual word alignment method for Vietnamese and Chinese performs well and precision, recall rates are increased by 28.57 and 25.02 %, AER is reduced by 14.25 %.
引用
收藏
页码:537 / 543
页数:6
相关论文
共 18 条
  • [1] Wang XZ(2014)Non-naive bayesian classifiers for classification problems with continuous attributes Cybern IEEE Trans 44 21-39
  • [2] He YL(2014)A new approach to classifier fusion based on upper integral IEEE Trans Cybern 44 620-182
  • [3] Wang DD(2009)Globalisation, networks and translation: a Chinese perspective Perspect Stud Transl 16 169-311
  • [4] Wang XZ(1993)The mathematics of statistical machine translation: parameter estimation Comput Linguist 19 263-51
  • [5] Wang R(2003)A systematic comparison of various statistical alignment models Comput Linguist 29 19-339
  • [6] Feng HM(2010)Discriminative word alignment by linear modeling Comput Linguist 36 303-undefined
  • [7] Wang HC(undefined)undefined undefined undefined undefined-undefined
  • [8] Tang J(undefined)undefined undefined undefined undefined-undefined
  • [9] Gentzler E(undefined)undefined undefined undefined undefined-undefined
  • [10] Brown PF(undefined)undefined undefined undefined undefined-undefined