Negation-based transfer learning for improving biomedical Named Entity Recognition and Relation Extraction

被引:16
作者
Fabregat, Hermenegildo [1 ,3 ]
Duque, Andres [1 ,2 ]
Martinez-Romo, Juan [1 ,2 ]
Araujo, Lourdes [1 ,2 ]
机构
[1] Univ Nacl Educ Distancia UNED ETS Ingn Informat, Juan del Rosal 16, Madrid 28040, Spain
[2] Escuela Nacl San IMIENS, Inst Mixto Invest, Madrid, Spain
[3] Avature Machine Learning, Madrid, Spain
关键词
Transfer learning; Named Entity Recognition; Negation detection; Relation Extraction; CORPUS;
D O I
10.1016/j.jbi.2022.104279
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objectives: Named Entity Recognition (NER) and Relation Extraction (RE) are two of the most studied tasks in biomedical Natural Language Processing (NLP). The detection of specific terms and entities and the relationships between them are key aspects for the development of more complex automatic systems in the biomedical field. In this work, we explore transfer learning techniques for incorporating information about negation into systems performing NER and RE. The main purpose of this research is to analyse to what extent the successful detection of negated entities in separate tasks helps in the detection of biomedical entities and their relationships.Methods: Three neural architectures are proposed in this work, all of them mainly based on Bidirectional Long Short-Term Memory (Bi-LSTM) networks and Conditional Random Fields (CRFs). While the first architecture is devoted to detecting triggers and scopes of negated entities in any domain, two specific models are developed for performing isolated NER tasks and joint NER and RE tasks in the biomedical domain. Then, weights related to negation detection learned by the first architecture are incorporated into those last models. Two different languages, Spanish and English, are taken into account in the experiments.Results: Performance of the biomedical models is analysed both when the weights of the neural networks are randomly initialized, and when weights from the negation detection model are incorporated into them. Improvements of around 3.5% of F-Measure in the English language and more than 7% in the Spanish language are achieved in the NER task, while the NER+RE task increases F-Measure scores by more than 13% for the NER submodel and around 2% for the RE submodel.Conclusions: The obtained results allow us to conclude that negation-based transfer learning techniques are appropriate for performing biomedical NER and RE tasks. These results highlight the importance of detecting negation for improving the identification of biomedical entities and their relationships. The explored techniques show robustness by maintaining consistent results and improvements across different tasks and languages.
引用
收藏
页数:17
相关论文
共 77 条
[1]  
Agerri R., 2018, CEUR WORKSHOP PROC, P25
[2]  
Agerri R, 2014, LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P3823
[3]   BERT-Based Transfer-Learning Approach for Nested Named-Entity Recognition Using Joint Labeling [J].
Agrawal, Ankit ;
Tripathi, Sarsij ;
Vardhan, Manu ;
Sihag, Vikas ;
Choudhary, Gaurav ;
Dragoni, Nicola .
APPLIED SCIENCES-BASEL, 2022, 12 (03)
[4]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[5]  
Benikova D, 2014, LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P2524
[6]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[7]  
Bravo A., 2019, IBERLEF SEPLN, P51
[8]   Extraction of semantic biomedical relations from text using conditional random fields [J].
Bundschus, Markus ;
Dejori, Mathaeus ;
Stetter, Martin ;
Tresp, Volker ;
Kriegel, Hans-Peter .
BMC BIOINFORMATICS, 2008, 9 (1)
[9]  
Cardellino Cristian, 2019, Spanish Billion Words Corpus and Embeddings
[10]   Measuring the effect of different types of unsupervised word representations on Medical Named Entity Recognition [J].
Casillas, Arantza ;
Ezeiza, Nerea ;
Goenaga, Takes ;
Perez, Alicia ;
Soto, Xabier .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 129 :100-106