A NOVEL APPROACH TO MACHINE TRANSLATION: A PROPOSED LANGUAGE-INDEPENDENT SYSTEM BASED ON DEDUCTIVE SCHEMES

被引:0
作者
Fakhrahmad, S. M. [1 ]
Rezapour, A. R. [1 ]
Sadreddini, M. H. [1 ]
Jahromi, M. Zolghadri [1 ]
机构
[1] Shiraz Univ, Sch Elect & Elect Engn, Dept Comp Sci & Engn, Shiraz, Iran
关键词
Machine translation; example-based; rule-based; corpora-based; finite automata; grammar induction;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Compared to corpora-based machine translation methods, rule-based methods have deficiencies, which make them unattractive for the researchers of this field. The first problem is that these methods are language dependent. Rule-based methods require the syntactic information about source and target languages. On the other hand, in many cases, especially for proverbs and specific expressions, syntactic rules are no longer useful. In such cases, the use of example-based approaches is inevitable. In this work, we propose and integrate a set of novel schemes to introduce a new translation system, called BORNA. First a grammar induction method based on the Expectation Maximization (EM) algorithm is proposed. After representing the extracted knowledge in the form of a set of nested finite automata, a recursive model is proposed, which uses a combination of rule and example based techniques. In the translation phase, through a hierarchical chunking process, the input sentence is divided into a set of phrases. Each phrase is searched in the corpus of examples. If the phrase is found, it will not be chunked anymore. Otherwise, the phrase is divided into smaller sub-phrases. The simulation results show that BORNA outperforms its counterparts, significantly. Compared to PARS, Frengly and Google translators, BORNA receives the highest Bleu scores for its translations, while it results in the minimum values for different error measures, including PER, TER and WER.
引用
收藏
页码:59 / 72
页数:14
相关论文
共 31 条
[21]  
Hettige B., 2012, INT C ADV ICT EM REG
[22]   Example-based machine translation: a review and commentary [J].
Hutchins, John .
MACHINE TRANSLATION, 2005, 19 (3-4) :197-211
[23]  
Johnson H., 2007, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), P967
[24]  
Nagao M., 1984, 10th International Conference on Computational Linguistics. 22nd Annual Meeting of the Association for Computational Linguistics. Proceedings of Coling 84, P420
[25]   Structural semantic interconnections: A knowledge-based approach to word sense disambiguation [J].
Navigli, R ;
Velardi, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (07) :1075-1086
[26]   BLEU: a method for automatic evaluation of machine translation [J].
Papineni, K ;
Roukos, S ;
Ward, T ;
Zhu, WJ .
40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, :311-318
[27]  
Schwenk H., 2008, Proceedings of the International Workshop on Spoken Language Translation, IWSLT'08, P182
[28]  
Sinha R.M.K., 2003, ANGLAHINDI ENGLISH H, P112
[29]  
Snover M, 2006, P ASS MACH TRANSL AM
[30]  
SUMITA E, 1991, 29TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS : PROCEEDINGS OF THE CONFERENCE, P185