XLMRQA: Open-Domain Question Answering on Vietnamese Wikipedia-Based Textual Knowledge Source

被引:1
|
作者
Kiet Van Nguyen [1 ,2 ]
Phong Nguyen-Thuan Do [2 ]
Nhat Duy Nguyen [2 ]
Tin Van Huynh [1 ,2 ]
Anh Gia-Tuan Nguyen [1 ,2 ]
Ngan Luu-Thuy Nguyen [1 ,2 ]
机构
[1] Univ Informat Technol, Fac Informat Sci & Engn, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
关键词
Question answering; Transformer; BERT; XLM-R; Transfer learning; Machine reading comprehension;
D O I
10.1007/978-3-031-21743-2_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering (QA) is a natural language understanding task within the fields of information retrieval and information extraction that has attracted much attention from the computational linguistics and artificial intelligence research community in recent years because of the strong development of machine reading comprehension-based models. A reader-based QA system is a high-level search engine that can find correct answers to queries or questions in open-domain or domain-specific texts using machine reading comprehension (MRC) techniques. The majority of advancements in data resources and machine-learning approaches in the MRC and QA systems especially are developed significantly in two resource-rich languages such as English and Chinese. A low-resource language like Vietnamese has witnessed a scarcity of research on QA systems. This paper presents XLMRQA, the first Vietnamese QA system using a supervised transformer-based reader on the Wikipedia-based textual knowledge source (using the UIT-ViQuAD corpus), out-performing the two robust QA systems using deep neural network models: DrQA and BERTserini with 24.46% and 6.28%, respectively. From the results obtained on the three systems, we analyze the influence of question types on the performance of the QA systems.
引用
收藏
页码:377 / 389
页数:13
相关论文
共 50 条
  • [41] CREPE: Open-Domain Question Answering with False Presuppositions
    Yu, Xinyan Velocity
    Min, Sewon
    Zettlemoyer, Luke
    Hajishirzi, Hannaneh
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10457 - 10480
  • [42] Open-Domain Non-factoid Question Answering
    Khvalchik, Maria
    Kulkarni, Anagha
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 290 - 298
  • [43] Hybrid Hierarchical Retrieval for Open-Domain Question Answering
    Arivazhagan, Manoj Ghuhan
    Li, Lan
    Qi, Peng
    Chen, Xinchi
    Wang, William Yang
    Huang, Zhiheng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10680 - 10689
  • [44] Visual Explanation for Open-Domain Question Answering With BERT
    Shao, Zekai
    Sun, Shuran
    Zhao, Yuheng
    Wang, Siyuan
    Wei, Zhongyu
    Gui, Tao
    Turkay, Cagatay
    Chen, Siming
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 3779 - 3797
  • [45] Learning Transferable Features for Open-Domain Question Answering
    Zuin, Gianlucca
    Chaimowicz, Luiz
    Veloso, Adriano
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [46] Open-Domain Question Answering with Pre-Constructed Question Spaces
    Xiao, Jinfeng
    Wang, Lidan
    Dernoncourt, Franck
    Bui, Trung
    Sun, Tong
    Han, Jiawei
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 61 - 67
  • [47] Never-Ending Learning for Open-Domain Question Answering over Knowledge Bases
    Abujabal, Abdalghani
    Roy, Rishiraj Saha
    Yahya, Mohamed
    Weikum, Gerhard
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1053 - 1062
  • [48] Open-Domain Question Answering Goes Conversational via Question Rewriting
    Anantha, Raviteja
    Vakulenko, Svitlana
    Tu, Zhucheng
    Longpre, Shayne
    Pulman, Stephen
    Chappidi, Srinivas
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 520 - 534
  • [49] RobustQA: Benchmarking the Robustness of Domain Adaptation for Open-Domain Question Answering
    Han, Rujun
    Qi, Peng
    Zhang, Yuhao
    Liu, Lan
    Burger, Juliette
    Wang, William Yang
    Huang, Zhiheng
    Xiang, Bing
    Roth, Dan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4294 - 4311
  • [50] An Efficient Document Retrieval for Korean Open-Domain Question Answering Based on ColBERT
    Kang, Byungha
    Kim, Yeonghwa
    Shin, Youhyun
    Mourtzis, Dimitris
    APPLIED SCIENCES-BASEL, 2023, 13 (24):