Biomedical relation extraction via knowledge-enhanced reading comprehension

被引:25
作者
Chen, Jing [1 ]
Hu, Baotian [1 ]
Peng, Weihua [2 ]
Chen, Qingcai [1 ,3 ]
Tang, Buzhou [1 ,3 ]
机构
[1] Harbin Inst Technol Shenzhen, Intelligent Comp Res Ctr, Shenzhen, Peoples R China
[2] Baidu Int Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
关键词
Biomedical relation extraction; Reading comprehension; Knowledge attention mechanism;
D O I
10.1186/s12859-021-04534-5
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background In biomedical research, chemical and disease relation extraction from unstructured biomedical literature is an essential task. Effective context understanding and knowledge integration are two main research problems in this task. Most work of relation extraction focuses on classification for entity mention pairs. Inspired by the effectiveness of machine reading comprehension (RC) in the respect of context understanding, solving biomedical relation extraction with the RC framework at both intra-sentential and inter-sentential levels is a new topic worthy to be explored. Except for the unstructured biomedical text, many structured knowledge bases (KBs) provide valuable guidance for biomedical relation extraction. Utilizing knowledge in the RC framework is also worthy to be investigated. We propose a knowledge-enhanced reading comprehension (KRC) framework to leverage reading comprehension and prior knowledge for biomedical relation extraction. First, we generate questions for each relation, which reformulates the relation extraction task to a question answering task. Second, based on the RC framework, we integrate knowledge representation through an efficient knowledge-enhanced attention interaction mechanism to guide the biomedical relation extraction. Results The proposed model was evaluated on the BioCreative V CDR dataset and CHR dataset. Experiments show that our model achieved a competitive document-level F1 of 71.18% and 93.3%, respectively, compared with other methods. Conclusion Result analysis reveals that open-domain reading comprehension data and knowledge representation can help improve biomedical relation extraction in our proposed KRC framework. Our work can encourage more research on bridging reading comprehension and biomedical relation extraction and promote the biomedical relation extraction.
引用
收藏
页数:19
相关论文
共 29 条
[1]  
Bordes A, 2019, NIPS, P2787
[2]   Technical milestone - Medical subject headings used to search the biomedical literature [J].
Coletti, MH ;
Bleich, HL .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2001, 8 (04) :317-323
[3]  
Nguyen DQ, 2018, SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), P129
[4]   The Comparative Toxicogenomics Database: update 2019 [J].
Davis, Allan Peter ;
Grondin, Cynthia J. ;
Johnson, Robin J. ;
Sciaky, Daniela ;
McMorran, Roy ;
Wiegers, Jolene ;
Wiegers, Thomas C. ;
Mattingly, Carolyn J. .
NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) :D948-D954
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]   Chemical-induced disease relation extraction with various linguistic features [J].
Gu, Jinghang ;
Qian, Longhua ;
Zhou, Guodong .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
[7]  
Le HQ, 2017, INT CONF KNOWL SYS, P292, DOI 10.1109/KSE.2017.8119474
[8]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444
[9]   BioBERT: a pre-trained biomedical language representation model for biomedical text mining [J].
Lee, Jinhyuk ;
Yoon, Wonjin ;
Kim, Sungdong ;
Kim, Donghyeon ;
Kim, Sunkyu ;
So, Chan Ho ;
Kang, Jaewoo .
BIOINFORMATICS, 2020, 36 (04) :1234-1240
[10]  
Levy Omer, 2017, P 2017 SIGNLL C COMP, P333, DOI DOI 10.18653/V1/K17-1034