Machine Reading Comprehension Model for Low-Resource Languages and Experimenting on Vietnamese

被引:1
|
作者
Bach Hoang Tien Nguyen [1 ]
Dung Manh Nguyen [1 ]
Trang Thi Thu Nguyen [1 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, Hanoi, Vietnam
关键词
Low resource languages; Translated datasets; Pre-train layer;
D O I
10.1007/978-3-031-08530-7_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine Reading Comprehension (MRC) is a challenging task in natural language processing. In recent times, many large datasets and good models are public for this task, but most of them are for English only. Building a good MRC dataset always takes much effort, this paper proposes a method, called UtlTran, to improve the MRC quality for low-resource languages. In this method, all available MRC English datasets are collected and translated into the target language with some context-reducing strategies for better results. Tokens of question and context are initialized word representations using a word embedding model. They are then pre-trained with the MRC model with the translated dataset for the specific low-resource language. Finally, a small manual MRC dataset is used to continue fine-tuning the model to get the best results. The experimental results on the Vietnamese language show that the best word embedding model for this task is a multilingual one - XLM-R. Whereas, the best translation strategy is to reduce context by answer positions. The proposed model gives the best quality, i.e. F1 = 88.2% and Exact Match (EM) =71.8%, on the UIT-ViQuAD dataset, compared to the state-of-the-art models.
引用
收藏
页码:370 / 381
页数:12
相关论文
共 50 条
  • [21] Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages
    Choi, Gyu-Hyeon
    Shin, Jong-Hun
    Kim, Young-Kil
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 900 - 904
  • [22] Low-Resource Machine Transliteration Using Recurrent Neural Networks of Asian Languages
    Le, Ngoc Tan
    Sadat, Fatiha
    NAMED ENTITIES, 2018, : 95 - 100
  • [23] Incident-Driven Machine Translation and Name Tagging for Low-resource Languages
    Hermjakob, Ulf
    Li, Qiang
    Marcu, Daniel
    May, Jonathan
    Mielke, Sebastian J.
    Pourdamghani, Nima
    Pust, Michael
    Shi, Xing
    Knight, Kevin
    Levinboim, Tomer
    Murray, Kenton
    Chiang, David
    Zhang, Boliang
    Pan, Xiaoman
    Lu, Di
    Lin, Ying
    Ji, Heng
    MACHINE TRANSLATION, 2018, 32 (1-2) : 59 - 89
  • [24] Multilingual neural machine translation for low-resource languages by twinning important nodes
    Qorbani, Abouzar
    Ramezani, Reza
    Baraani, Ahmad
    Kazemi, Arefeh
    NEUROCOMPUTING, 2025, 634
  • [25] DRA: dynamic routing attention for neural machine translation with low-resource languages
    Wang, Zhenhan
    Song, Ran
    Yu, Zhengtao
    Mao, Cunli
    Gao, Shengxiang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [26] On the scalability of data augmentation techniques for low-resource machine translation between Chinese and Vietnamese
    Vu, Huan
    Bui, Ngoc Dung
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2023, 7 (02) : 241 - 253
  • [27] Revisiting Back-Translation for Low-Resource Machine Translation Between Chinese and Vietnamese
    Li, Hongzheng
    Sha, Jiu
    Shi, Can
    IEEE ACCESS, 2020, 8 (08) : 119931 - 119939
  • [28] Model Transfer for Tagging Low-resource Languages using a Bilingual Dictionary
    Fang, Meng
    Cohn, Trevor
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 587 - 593
  • [29] Enabling Medical Translation for Low-Resource Languages
    Musleh, Ahmad
    Durrani, Nadir
    Temnikova, Irina
    Nakov, Preslav
    Vogel, Stephan
    Alsaad, Osama
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT II, 2018, 9624 : 3 - 16
  • [30] Classifying educational materials in low-resource languages
    Sohsah, Gihad N.
    Guzey, Onur
    Tarmanini, Zaina
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 431 - 435