Saudi Learner Translation Corpus: The design and compilation of an English-Arabic learner translation corpus

被引:1
作者
Al-Harthi, Maha [1 ]
Alsaif, Amal [2 ]
Al-Nafjan, Eman [3 ]
Alshihri, Fatma [4 ]
Saleh, Mahmoud [5 ]
机构
[1] Princess Nourah Bint Abdulrahman Univ, Coll Languages, Dept Appl Linguist, Riyadh, Saudi Arabia
[2] Al Imam Mohammad Ibn Saud Islamic Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[3] King Saud bin Abdulaziz Univ Hlth Sci, Dept English, Riyadh, Saudi Arabia
[4] Princess Nourah Bint Abdulrahman Univ, Coll Languages, Dept Translat, Riyadh, Saudi Arabia
[5] King Saud Univ, Coll Languages & Translat, Dept Translat, Riyadh, Saudi Arabia
关键词
D O I
10.1371/journal.pone.0303729
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This article introduces the Saudi Learner Translation Corpus (SauLTC), an innovative multi-version English-Arabic parallel corpus featuring part-of-speech tagging. We describe the corpus parameters and compilation process and explain how textual processing and sentence alignment are conducted. The participants include 366 student translators, 48 instructors, and 23 alignment verifiers. The corpus provides access to two target versions of every ST to allow the detection of the changes in the translation and revision processes from the initial to the final draft. The translations were collected over three years, yielding 5,160,386 tokens. The metadata of 23 sentence alignment verifiers were added to the analysis as a unique variable to investigate individual differences in the manual verification process. This unidirectional corpus can be used to identify student translators' strategies and errors in translation and analyze the efficacy of instructors' feedback. Furthermore, it is accessible via an application and a website. It provides translation teachers and researchers with a database that can help develop corpus-based and corpus-driven teaching materials.
引用
收藏
页数:22
相关论文
共 38 条
[1]  
Abekawa T, 2008, SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, P2000
[2]   Syntactic Complexity in Translated eHealth discourse of COVID19: A Comparable Parallel Corpus Approach [J].
Afzaal, Muhammad ;
Du, Xiangtao .
ASIA PACIFIC TRANSLATION AND INTERCULTURAL STUDIES, 2023, :3-19
[3]  
Al-Harthi M., 2019, Proceedings of the 3rd Workshop on Arabic Corpus Linguistics
[4]   Conjunctive markers in translation from English to Arabic: a corpus-based study [J].
Alasmri, Ibrahim ;
Kruger, Haidee .
PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2018, 26 (05) :767-788
[5]  
Albir A H., 2017, Researching Translation Competence by PACTE Group
[7]  
Alshihri F., Annotating translator learner corpora: expanding boundaries
[8]  
Baker Mona, 2000, Target, V12, P241, DOI [10.1075/target.12.2.04bak, DOI 10.1075/TARGET.12.2.04BAK]
[9]  
Baker Mona., 1995, Target, V2, P223, DOI DOI 10.1075/TARGET.7.2.03BAK
[10]  
Baker Mona., 1999, INT J CORPUS LINGUIS, V4, P281, DOI [10.1075/ijcl.4.2.05bak, DOI 10.1075/IJCL.4.2.05BAK]