Relative Clause based Text Simplification for Improved English to Hindi Translation

被引:0
作者
Saini, Sandeep [1 ]
Sehgal, Umang [2 ]
Sahula, Vineet [3 ]
机构
[1] LNM Inst Informat Technol, Dept Elect & Commun Engn, Jaipur, Rajasthan, India
[2] LNM Inst Informat Technol, Dept Comp Sci & Engn, Jaipur, Rajasthan, India
[3] Malviya Natl Inst Technol, Dept Elect & Commun Engn, Jaipur, Rajasthan, India
来源
2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2015年
关键词
Machine Translation; Text Simplification; Relative Clause identification; Computational Intelligence;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Language translation is one of the most research and development oriented topic in today's world because of its increasing demand and application. Knowledge of grammar structure of source and target languages is must for translating one language to other. Clauses are an integral part of any language and helps in constructing complex sentences in different contexts. This complication leads to a low score of translation in almost every machine translation engine existing in the world. In this work, we are focusing on relative clause identification and extraction for text simplification. The generated simple sentences are then fed to the existing translation engines for translation. Link Parser based parsing techniques are used to parse the sentence tree. In this work we have focused on achieving better quality of English-Hindi translations. The proposed approach is tested manually on a sufficiently large dataset and shows promising and better translation score than the conventional approaches.
引用
收藏
页码:1479 / 1484
页数:6
相关论文
共 19 条
[1]  
[Anonymous], 2002, P 40 ANN M ASS COMP
[2]  
[Anonymous], 2014, Proceedings of The 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations, DOI [10.3115/v1/W14-1201, DOI 10.3115/V1/W14-1201]
[3]  
[Anonymous], 2006, PROC 5 INT C LANGUAG
[4]  
Aranzabe M. J., 2012, P NAT LANG PROC IMPR
[5]  
Boguraev B., 1987, Computational Linguistics, V13, P203
[6]  
Brown PeterF., 1993, Computational linguistics, V19, P263311
[7]   Splitting Complex Sentences for Natural Language Processing Applications: Building a Simplified Spanish Corpus [J].
Camacho Collados, Jose .
CORPUS RESOURCES FOR DESCRIPTIVE AND APPLIED STUDIES. CURRENT CHALLENGES AND FUTURE DIRECTIONS: SELECTED PAPERS FROM THE 5TH INTERNATIONAL CONFERENCE ON CORPUS LINGUISTICS (CILC2013), 2013, 95 :464-472
[8]  
Doddington George, 2002, P 2 INT C HUM LANG T
[9]  
Doi Takao, 2003, P WORKSH BUILD US PA
[10]  
Dornescu Iustin, 2014, COLING 2014, V2014