Multi-stage transfer learning with BERTology-based language models for question answering system in vietnamese

被引:0
|
作者
Kiet Van Nguyen
Phong Nguyen-Thuan Do
Nhat Duy Nguyen
Anh Gia-Tuan Nguyen
Ngan Luu-Thuy Nguyen
机构
[1] University of Information Technology,
[2] Vietnam National University,undefined
来源
International Journal of Machine Learning and Cybernetics | 2023年 / 14卷
关键词
Question Answering; Machine Reading Comprehension; Transfer Learning; BERT; BERTology; SBERT; BiLSTM; Transformer;
D O I
暂无
中图分类号
学科分类号
摘要
With the fast growth of information science and engineering, a large number of textual data generated are valuable for natural language processing and its applications. Particularly, finding correct answers to natural language questions or queries requires spending tremendous time and effort in human life. While using search engines to discover information, users manually determine the answer to a given question on a range of retrieved texts or documents. Question answering relies heavily on the capability to automatically comprehend questions in human language and extract meaningful answers from a single text. In recent years, such question–answering systems have become increasingly popular using machine reading comprehension techniques. On the other hand, high-resource languages (e.g., English and Chinese) have witnessed tremendous growth in question-answering methodologies based on various knowledge sources. Besides, powerful BERTology-based language models only encode texts with a limited length. The longer texts contain more distractor sentences that affect the QA system performance. Vietnamese has a variety of question words in the same question type. To address these challenges, we propose ViQAS, a new question–answering system with multi-stage transfer learning using language models based on BERTology for a low-resource language such as Vietnamese. Last but not least, our QA system is integrated with Vietnamese characteristics and transformer-based evidence extraction techniques into an effective contextualized language model-based QA system. As a result, our proposed system outperforms our forty retriever-reader QA configurations and seven state-of-the-art QA systems such as DrQA, BERTserini, BERTBM25, XLMRQA, ORQA, COBERT, and NeuralQA on three Vietnamese benchmark question answering datasets.
引用
收藏
页码:1877 / 1902
页数:25
相关论文
共 50 条
  • [21] Chinese Diabetes Question Classification Using Large Language Models and Transfer Learning
    Ge, Chengze
    Ling, Hongshun
    Quan, Fuliang
    Zeng, Jianping
    HEALTH INFORMATION PROCESSING: EVALUATION TRACK PAPERS, CHIP 2023, 2024, 2080 : 205 - 213
  • [22] A question answering system for assembly process of wind turbines based on multi-modal knowledge graph and large language model
    Hu, Zhiqiang
    Li, Xinyu
    Pan, Xinyu
    Wen, Sijie
    Bao, Jinsong
    JOURNAL OF ENGINEERING DESIGN, 2023,
  • [23] A Multi-Stage Transformer Network for Image Dehazing Based on Contrastive Learning
    Gao F.
    Ji S.
    Guo J.
    Hou J.
    Ouyang C.
    Yang B.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2023, 57 (01): : 195 - 210
  • [24] Multi-Stage Network Attack Detection Algorithm Based on Gaussian Mixture Hidden Markov Model and Transfer Learning
    Wang, Qian
    Wang, Weilong
    Wang, Yan
    Ren, Jiadong
    Zhang, Bing
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 3470 - 3484
  • [25] SentiMedQAer: A Transfer Learning-Based Sentiment-Aware Model for Biomedical Question Answering
    Zhu, Xian
    Chen, Yuanyuan
    Gu, Yueming
    Xiao, Zhifeng
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [26] Entity-aware answer sentence selection for question answering with transformer-based language models
    Abbasiantaeb, Zahra
    Momtazi, Saeedeh
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2022, 59 (03) : 755 - 777
  • [27] A question answering system based on mineral exploration ontology generation: A deep learning methodology
    Qiu, Qinjun
    Tian, Miao
    Ma, Kai
    Tan, Yong Jian
    Tao, Liufeng
    Xie, Zhong
    ORE GEOLOGY REVIEWS, 2023, 153
  • [28] Entity-aware answer sentence selection for question answering with transformer-based language models
    Zahra Abbasiantaeb
    Saeedeh Momtazi
    Journal of Intelligent Information Systems, 2022, 59 : 755 - 777
  • [29] Evaluating Reasoning in Factoid based Question Answering System by Using Machine Learning Approach
    Pundge, Ajitkumar Meshram
    Mahender, C. Namrata
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 821 - 825
  • [30] FarsNewsQA: a deep learning-based question answering system for the Persian news articles
    Kazemi, Arefeh
    Zojaji, Zahra
    Malverdi, Mahdi
    Mozafari, Jamshid
    Ebrahimi, Fatemeh
    Abadani, Negin
    Varasteh, Mohammad Reza
    Nematbakhsh, Mohammad Ali
    INFORMATION RETRIEVAL JOURNAL, 2023, 26 (01):