Multi-task transfer learning for biomedical machine reading comprehension

被引:0
|
作者
Guo, Wenyang [1 ]
Du, Yongping [1 ]
Zhao, Yiliang [1 ]
Ren, Keyan [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
国家重点研发计划;
关键词
biomedical machine reading comprehension; multi-task learning; transfer learning; attention; data augmentation;
D O I
10.1504/IJDMB.2020.107878
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biomedical machine reading comprehension aims to extract the answer to the given question from complex biomedical passages, which requires the machine to have the ability to process strong comprehension on natural language. Recent progress has made on this task, but still severely restricted by the insufficient training data due to the domain-specific nature. To solve this problem, we propose a hierarchical question-aware context learning model trained by the multi-task transfer learning algorithm, which can capture the interaction between the question and the passage layer by layer, with multi-level embeddings to strengthen the ability of the language representation. The multi-task transfer learning algorithm leverages the advantages of different machine reading comprehension tasks to improve model generalisation and robustness, pre-training on multiple large-scale open-domain data sets and fine-tuning on the target-domain training set. Moreover, data augmentation is also adopted to create new training samples with various expressions. The public biomedical data set collected from PubMed provided by BioASQ is used to evaluate the model performance. The results show that our method is superior to the best recent solution and achieves a new state of the art.
引用
收藏
页码:234 / 250
页数:17
相关论文
共 50 条
  • [31] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [32] Driver Drowsiness Detection by Multi-task and Transfer Learning
    Chang, Yuan
    Kameyama, Wataru
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177
  • [33] Multi-task Transfer Learning for Bayesian Network Structures
    Benikhlef, Sarah
    Leray, Philippe
    Raschia, Guillaume
    Ben Messaoud, Montassar
    Sakly, Fayrouz
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2021, 2021, 12897 : 217 - 228
  • [34] Episodic memory transfer for multi-task reinforcement learning
    Sorokin, Artyom Y.
    Burtsev, Mikhail S.
    BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, 2018, 26 : 91 - 95
  • [35] BIOMRC: A Dataset for Biomedical Machine Reading Comprehension
    Stavropoulos, Petros
    Pappas, Dimitris
    Androutsopoulos, Ion
    McDonald, Ryan
    19TH SIGBIOMED WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2020), 2020, : 140 - 149
  • [36] An Empirical Study of Multi-Task Learning on BERT for Biomedical Text Mining
    Peng, Yifan
    Chen, Qingyu
    Lu, Zhiyong
    19TH SIGBIOMED WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2020), 2020, : 205 - 214
  • [37] Multi-task learning for few-shot biomedical relation extraction
    Moscato, Vincenzo
    Napolano, Giuseppe
    Postiglione, Marco
    Sperli, Giancarlo
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (11) : 13743 - 13763
  • [38] BioADAPT-MRC: adversarial learning-based domain adaptation improves biomedical machine reading comprehension task
    Mahbub, Maria
    Srinivasan, Sudarshan
    Begoli, Edmon
    Peterson, Gregory D.
    BIOINFORMATICS, 2022, 38 (18) : 4369 - 4379
  • [39] A multi-task learning based approach to biomedical entity relation extraction
    Li, Qingqing
    Yang, Zhihao
    Luo, Ling
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    Yang, Liang
    Xu, Kan
    Zhang, Yijia
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 680 - 682
  • [40] Multi-task learning for few-shot biomedical relation extraction
    Vincenzo Moscato
    Giuseppe Napolano
    Marco Postiglione
    Giancarlo Sperlì
    Artificial Intelligence Review, 2023, 56 : 13743 - 13763