Multi-stage transfer learning with BERTology-based language models for question answering system in vietnamese

被引:0
|
作者
Kiet Van Nguyen
Phong Nguyen-Thuan Do
Nhat Duy Nguyen
Anh Gia-Tuan Nguyen
Ngan Luu-Thuy Nguyen
机构
[1] University of Information Technology,
[2] Vietnam National University,undefined
来源
International Journal of Machine Learning and Cybernetics | 2023年 / 14卷
关键词
Question Answering; Machine Reading Comprehension; Transfer Learning; BERT; BERTology; SBERT; BiLSTM; Transformer;
D O I
暂无
中图分类号
学科分类号
摘要
With the fast growth of information science and engineering, a large number of textual data generated are valuable for natural language processing and its applications. Particularly, finding correct answers to natural language questions or queries requires spending tremendous time and effort in human life. While using search engines to discover information, users manually determine the answer to a given question on a range of retrieved texts or documents. Question answering relies heavily on the capability to automatically comprehend questions in human language and extract meaningful answers from a single text. In recent years, such question–answering systems have become increasingly popular using machine reading comprehension techniques. On the other hand, high-resource languages (e.g., English and Chinese) have witnessed tremendous growth in question-answering methodologies based on various knowledge sources. Besides, powerful BERTology-based language models only encode texts with a limited length. The longer texts contain more distractor sentences that affect the QA system performance. Vietnamese has a variety of question words in the same question type. To address these challenges, we propose ViQAS, a new question–answering system with multi-stage transfer learning using language models based on BERTology for a low-resource language such as Vietnamese. Last but not least, our QA system is integrated with Vietnamese characteristics and transformer-based evidence extraction techniques into an effective contextualized language model-based QA system. As a result, our proposed system outperforms our forty retriever-reader QA configurations and seven state-of-the-art QA systems such as DrQA, BERTserini, BERTBM25, XLMRQA, ORQA, COBERT, and NeuralQA on three Vietnamese benchmark question answering datasets.
引用
收藏
页码:1877 / 1902
页数:25
相关论文
共 50 条
  • [41] Modeling Extractive Question Answering Using Encoder-Decoder Models with Constrained Decoding and Evaluation-Based Reinforcement Learning
    Li, Shaobo
    Sun, Chengjie
    Liu, Bingquan
    Liu, Yuanchao
    Ji, Zhenzhou
    MATHEMATICS, 2023, 11 (07)
  • [42] Multi-stage transfer learning for lung segmentation using portable X-ray devices for patients with COVID-19
    Vidal, Placido L.
    de Moura, Joaquim
    Novo, Jorge
    Ortega, Marcos
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173 (173)
  • [43] Breast Cancer Diagnosis in Digital Breast Tomosynthesis: Effects of Training Sample Size on Multi-Stage Transfer Learning Using Deep Neural Nets
    Samala, Ravi K.
    Chan, Heang-Ping
    Hadjiiski, Lubomir
    Helvie, Mark A.
    Richter, Caleb D.
    Cha, Kenny H.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (03) : 686 - 696
  • [44] An improved Wi-Fi sensing-based human activity recognition using multi-stage deep learning model
    Sruthi, P.
    Udgata, Siba K.
    SOFT COMPUTING, 2022, 26 (09) : 4509 - 4518
  • [45] A new multi-source Transfer Learning method based on Two-stage Weighted Fusion
    Huang, Linqing
    Fan, Jinfu
    Zhao, Wangbo
    You, Yang
    KNOWLEDGE-BASED SYSTEMS, 2023, 262
  • [46] Remaining useful life prediction based on transfer multi-stage shrinkage attention temporal convolutional network under variable working conditions
    Li, Wanxiang
    Shang, Zhiwu
    Gao, Maosheng
    Qian, Shiqi
    Feng, Zehua
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 226
  • [47] Multi-Feature Representation Based COVID-19 Risk Stage Evaluation With Transfer Learning
    Kong, Xiangjie
    Li, Ning
    Zhang, Chenwei
    Shen, Guojiang
    Ning, Zhaolong
    Qiu, Tie
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (03): : 1359 - 1375
  • [48] Angular resampling-assisted multi-stage parameter transfer learning method for fault diagnosis from stable to time-varying operating conditions
    Huang, Guoyu
    Lin, Cuiying
    Kong, Yun
    Han, Qinkai
    Zhang, Jie
    Dai, Qiyi
    Li, Xiaowei
    Chen, Ke
    Dong, Mingming
    Chu, Fulei
    MEASUREMENT, 2025, 253
  • [49] Two-stage transfer learning-based nonparametric system identification with Gaussian process regression
    Wang, Shuyu
    Xu, Zuhua
    Chen, Minghao
    Zhao, Jun
    Fang, Jiakun
    Song, Chunyue
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 189
  • [50] Improving the performance of multi-stage HER2 breast cancer detection in hematoxylin-eosin images based on ensemble deep learning
    Pateel, G. P.
    Senapati, Kedarnath
    Pandey, Abhishek Kumar
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100