Multi-stage transfer learning with BERTology-based language models for question answering system in vietnamese

被引:0
|
作者
Kiet Van Nguyen
Phong Nguyen-Thuan Do
Nhat Duy Nguyen
Anh Gia-Tuan Nguyen
Ngan Luu-Thuy Nguyen
机构
[1] University of Information Technology,
[2] Vietnam National University,undefined
来源
International Journal of Machine Learning and Cybernetics | 2023年 / 14卷
关键词
Question Answering; Machine Reading Comprehension; Transfer Learning; BERT; BERTology; SBERT; BiLSTM; Transformer;
D O I
暂无
中图分类号
学科分类号
摘要
With the fast growth of information science and engineering, a large number of textual data generated are valuable for natural language processing and its applications. Particularly, finding correct answers to natural language questions or queries requires spending tremendous time and effort in human life. While using search engines to discover information, users manually determine the answer to a given question on a range of retrieved texts or documents. Question answering relies heavily on the capability to automatically comprehend questions in human language and extract meaningful answers from a single text. In recent years, such question–answering systems have become increasingly popular using machine reading comprehension techniques. On the other hand, high-resource languages (e.g., English and Chinese) have witnessed tremendous growth in question-answering methodologies based on various knowledge sources. Besides, powerful BERTology-based language models only encode texts with a limited length. The longer texts contain more distractor sentences that affect the QA system performance. Vietnamese has a variety of question words in the same question type. To address these challenges, we propose ViQAS, a new question–answering system with multi-stage transfer learning using language models based on BERTology for a low-resource language such as Vietnamese. Last but not least, our QA system is integrated with Vietnamese characteristics and transformer-based evidence extraction techniques into an effective contextualized language model-based QA system. As a result, our proposed system outperforms our forty retriever-reader QA configurations and seven state-of-the-art QA systems such as DrQA, BERTserini, BERTBM25, XLMRQA, ORQA, COBERT, and NeuralQA on three Vietnamese benchmark question answering datasets.
引用
收藏
页码:1877 / 1902
页数:25
相关论文
共 50 条
  • [1] Multi-stage transfer learning with BERTology-based language models for question answering system in vietnamese
    Nguyen, Kiet Van
    Do, Phong Nguyen-Thuan
    Nguyen, Nhat Duy
    Nguyen, Anh Gia-Tuan
    Nguyen, Ngan Luu-Thuy
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1877 - 1902
  • [2] Distant Supervision for Multi-Stage Fine-Tuning in Retrieval-Based Question Answering
    Xie, Yuqing
    Yang, Wei
    Tan, Luchen
    Xiong, Kun
    Yuan, Nicholas Jing
    Huai, Baoxing
    Li, Ming
    Lin, Jimmy
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2934 - 2940
  • [3] Enhancement of Question Answering System Accuracy via Transfer Learning and BERT
    Duan, Kai
    Du, Shiyu
    Zhang, Yiming
    Lin, Yanru
    Wu, Hongzhuo
    Zhang, Quan
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [4] Efficient Question Answering Based on Language Models and Knowledge Graphs
    Li, Fengying
    Huang, Hongfei
    Dong, Rongsheng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 340 - 351
  • [5] Tourism scene classification based on multi-stage transfer learning model
    Qi, Tangquan
    Xu, Yong
    Ling, Haibin
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08) : 4341 - 4352
  • [6] Tourism scene classification based on multi-stage transfer learning model
    Tangquan Qi
    Yong Xu
    Haibin Ling
    Neural Computing and Applications, 2019, 31 : 4341 - 4352
  • [7] Evaluation of an Arabic Chatbot Based on Extractive Question-Answering Transfer Learning and Language Transformers
    Alruqi, Tahani N.
    Alzahrani, Salha M.
    AI, 2023, 4 (03) : 667 - 691
  • [8] Multi-Stage Transfer Learning System with Lightweight Architectures in Medical Image Classification
    Godasu, Rajesh
    El-Gayar, Omar
    Sutrave, Kruttika
    AMCIS 2020 PROCEEDINGS, 2020,
  • [9] DATLMedQA: A Data Augmentation and Transfer Learning Based Solution for Medical Question Answering
    Zhou, Shuohua
    Zhang, Yanping
    APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [10] Deep learning based question answering system in Bengali
    Mayeesha, Tasmiah Tahsin
    Sarwar, Abdullah Md
    Rahman, Rashedur M.
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2021, 5 (02) : 145 - 178