Improved statistical machine translation model with topic-based paraphrase

被引:0
作者
Su, Jin-Song [1 ]
Dong, Huai-Lin [1 ]
Chen, Yi-Dong [2 ]
Shi, Xiao-Dong [2 ]
Wu, Qing-Qiang [1 ]
机构
[1] School of Software, Xiamen University, Xiamen
[2] Department of Cognitive Science, Xiamen University, Xiamen
来源
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science) | 2014年 / 48卷 / 10期
关键词
Paraphrase; Statistical machine translation; Topic model;
D O I
10.3785/j.issn.1008-973X.2014.10.019
中图分类号
学科分类号
摘要
To deal with the defect of the conventional parallel corpus based paraphrase extraction method which neglects document-level context, the paraphrase extraction and its application in statistical machine translation were improved by introducing the context based on topic model. The problem that how to better learn two kinds of paraphrase probabilities: topic-insensitive and topic-sensitive ones, was mainly analyzed. Both of the two probabilities can be incorporated into the modeling of statistical machine translation by using different methods. The experimental results on various test sets demonstrated the effectiveness of the approach. ©, 2014, Zhejiang University. All right reserved.
引用
收藏
页码:1843 / 1849
页数:6
相关论文
共 27 条
[1]  
Koehn P., Och F.J., Marcu D., Statistical phrase-based translation, Proceedings of HLT-NAACL, pp. 48-54, (2003)
[2]  
Chiang D., Hierarchical phrase-based translation, Computational Linguistics, 33, 2, pp. 201-288, (2007)
[3]  
Scalable inference and training of context-rich syntactic translation models, Proceedings of ACL, pp. 961-968, (2006)
[4]  
Liu Y., Liu Q., Lin S.-X., 2006.Tree-to-string alignment template for statistical machine translation, Proceedings of ACL, pp. 609-616, (2006)
[5]  
Wu D.-K., Stochastic inversion transduction grammars and bilingual parsing of parallel corpora, Computational Linguistics, 23, 3, pp. 377-404, (1997)
[6]  
Xiong D.-Y., Liu Q., Lin S.-X., Maximum entropy based on phrase reordering model for statistical machine translation, Proceedings of ACL, pp. 521-528, (2006)
[7]  
Mitamura T., Nyberg E., Automatic rewriting for controlled language translation, Proceedings of NLPRS, pp. 1-12, (2001)
[8]  
Yamamoto K., Machine translation by interaction between paraphraser and transfer, Proceedings of COLING, pp. 1107-1113, (2002)
[9]  
Zhang Y.J., Yamamoto K., Paraphrasing of Chinese utterances, Proceedings of COLING, pp. 1163-1169, (2002)
[10]  
Shimohata M., Sumita E., Matsumoto Y., Building a paraphrase corpus for speech translation, Proceedings of LREC, pp. 1407-1410, (2004)