MTTLADE: A multi-task transfer learning-based method for adverse drug events extraction

被引:32
作者
El-allaly, Ed-drissiya [1 ,2 ]
Sarrouti, Mourad [3 ]
En-Nahnahi, Noureddine [1 ]
El Alaoui, Said Ouatik [1 ,2 ]
机构
[1] Sidi Mohamed Ben Abdellah Univ, Lab Informat Signals Automat & Cognitivism LISAC, Fac Sci Dhar EL Mehraz, Fes, Morocco
[2] Ibn Tofail Univ, Lab Engn Sci, Natl Sch Appl Sci, Kenitra, Morocco
[3] US Natl Lib Med, NIH, Bethesda, MD USA
关键词
Adverse drug events; Transfer learning; Multi-task learning; Named entity recognition; Relation extraction; Natural language processing; INFORMATION-RETRIEVAL; ENTITY RECOGNITION; CLASSIFICATION;
D O I
10.1016/j.ipm.2020.102473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting mentions of Adverse Drug Events (ADEs) and the potential relationships among them from clinical textual data remains challenging tasks due to the following issues: (1) many ADEs mentions have multiple relations, also known as the multi-head issue, and (2) many ADEs relations contain discontinuous mentions. To deal with these problems, in this paper, we propose a Multi -Task Transfer Learning-based method for ADEs extraction, called MTTLADE. Firstly, the MTTLADE system converts the ADEs extraction task to a dual-task sequence labelling which includes ADEs source mention extraction (ADE-SE) and ADEs attribute-relation extraction (ADEAtt-RE) tasks. The ADE-SE task aims at extracting the source mentions that are likely related to at least one relation, while the ADE-Att-RE task consists in linking the previously identified source mentions to their target attributes and relation types by adopting a unified sequence labelling. Then, it uses the multi-task transfer learning (MTTL) based approach to process the two proposed tasks simultaneously. The MTTL adopts a shared representation obtained from the pre-trained language model learned through transformer architecture and ends up with task specific fine-tuning. This allows the MTTLADE system to yield more generalized representation across the tasks. Finally, MTTLADE produces sequences for each task from the generated model so as to extract ADEs mentions and relations. Experimental evaluations conducted on two datasets provided by the TAC 2017 and n2c2 2018 shared tasks show the effectiveness and generalizability of MTTLADE. The proposed MTTLADE system significantly outperforms the state-of-the-art ones on both datasets. The results also show that combining transfer and multitask learning makes MTTLADE more effective for solving the multi-head issue and extracting intricate ADEs.
引用
收藏
页数:17
相关论文
共 62 条
  • [61] Identifying relations of medications with adverse drug events using recurrent convolutional neural networks and gradient boosting
    Yang, Xi
    Bian, Jiang
    Fang, Ruogu
    Bjarnadottir, Ragnhildur, I
    Hogan, William R.
    Wu, Yonghui
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (01) : 65 - 72
  • [62] Deep ranking based cost-sensitive multi-label learning for distant supervision relation extraction
    Ye, Hai
    Luo, Zhunchen
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)