Recognition and Segmentation of English Long and Short Sentences Based on Machine Translation

被引:2
作者
Zhang, Tiehu [1 ]
机构
[1] Xian Aeronaut Univ, Sch Foreign Languages, Xian, Shaanxi, Peoples R China
关键词
Machine translation; long sentence; regular match; error-driven method;
D O I
10.3991/ijet.v15i101.10182
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
With the advent of the information age, long sentences which include many words and have more complex structures. The translation of long sentences in English-Chinese machine translation has always been the focus of research. In this study, 400 long sentences were randomly selected from NTCIR-9 patent corpus for testing the recognition and segmentation effects of regular match method and error-driven method, and the accuracy rate of the translation was compared on Baidu Online Translation Platform. The results demonstrated that the regular matching method was effective in recognizing and segmenting long sentences, nevertheless there were many defects; the error-driven method was more effective in recognizing and segmenting long sentences; the former increased by 4.8% of the BLEU value of the translated text on Baidu Online Translation Platform and the latter increased by 12.1%, which showed that the error-driven method was more effective in machine translation.
引用
收藏
页码:152 / 162
页数:11
相关论文
共 15 条
[1]  
[Anonymous], 2016, P 2016 C EMPIRICAL M, DOI [DOI 10.18653/V1/D16-1163, 10.18653/V1/D16-1163.URLhttps:/]
[2]  
Bojar O., 2015, P 10 WORKSH STAT MAC, P1, DOI [DOI 10.18653/V1/W15-3001, 10.18653/]
[3]   Embracing the threat: machine translation as a solution for subtitling [J].
Bywood, Lindsay ;
Georgakopoulou, Panayota ;
Etchegoyhen, Thierry .
PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2017, 25 (03) :492-508
[4]  
Ebrahimi J, 2018, On adversarial examples for character-level neural machine translation
[5]  
Firat O., 2016, P 2016 C N AM CHAPT, P866
[6]  
Germann Ulrich, 2015, Prague Bulletin of Mathematical Linguistics, P39, DOI 10.1515/pralin-2015-0012
[7]  
Jean Sebastien, 2015, P 10 WORKSHOP STAT M, P134
[8]  
Marciano J. P., 2017, METHODS SYSTEMS MULT
[9]  
Luong MT, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1054
[10]   Statistical Machine Translation from and into Morphologically Rich and Low Resourced Languages [J].
Pushpananda, Randil ;
Weerasinghe, Ruvan ;
Niranjan, Mahesan .
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 :545-556