A Novel Automated Approach to Mutation-Cancer Relation Extraction by Incorporating Heterogeneous Knowledge

被引:2
|
作者
Cao, Jiarun [1 ]
van Veen, Elke M. [2 ,3 ]
Peek, Niels [4 ]
Renehan, Andrew G. [5 ,6 ,7 ]
Ananiadou, Sophia [1 ,7 ]
机构
[1] Univ Manchester, Natl Ctr Text Min, Dept Comp Sci, Manchester M13 9PL, England
[2] Manchester Univ Hosp NHS Fdn Trust, St Marys Hosp, Manchester Ctr Genom Med, Manchester M13 9PL, England
[3] Univ Manchester, Fac Biol Med Hlth, Sch Biol Sci, Div Evolut,Genom Sci, Manchester M13 9PL, England
[4] Univ Manchester, Fac Biol Med Hlth, Sch Hlth Sci, Div Informat Imaging Data Sci,Ctr Hlth Infromat, Manchester M13, England
[5] Univ Manchester, Sch Med Sci, Fac Biol Med Hlth, Div Canc Sci, Manchester M13 9PL, England
[6] Univ Manchester, Manchester Canc Res Ctr, NIHR Manchester Biomed Res Ctr, Manchester M13 9PL, England
[7] Alan Turing Inst, London NW1 2DB, England
基金
英国生物技术与生命科学研究理事会;
关键词
Cancer; Bioinformatics; Genomics; Text mining; Biological system modeling; Tumors; Task analysis; Information extraction; relation extraction; biomedical text mining; deep learning; TMVAR;
D O I
10.1109/JBHI.2022.3220924
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic extraction of relations between gene mutations and cancer entities occurring in the cancer literature using text mining can rapidly provide vital information to support precision cancer medicine. However, mutation-cancer relation extraction is more challenging than general relation extraction from free text, since it is often not possible without cancer-specific background knowledge and thus the model replies on a deeper understanding of complex surrounding tokens. We propose a deep learning model that jointly extracts mutations and their associated cancers. Background knowledge comes from two different knowledge bases which store different types of information about mutations. Given the different ways in which knowledge is stored in these two resources, we propose two separate methods for embedding knowledge, namely sentence-based knowledge integration and attribute-aware knowledge integration. The evaluation demonstrated that our model outperforms a number of baseline models and gains 96.00%, 92.57% and 94.57% F1 scores on three public datasets, EMU BCa, EMU PCa, and BRONCO, thus illustrating the effectiveness of our knowledge integration approach. The auxiliary experiments show that our models can utilize more informative text from the KBs and link the mutations to their corresponding cancer disease although the input text provides insufficient context.
引用
收藏
页码:1096 / 1105
页数:10
相关论文
共 15 条
  • [1] Relation Extraction for Knowledge Base Completion: A Supervised Approach
    Cerezo-Costas, Hector
    Martin-Vicente, Manuela
    SEMANTIC WEB CHALLENGES, SEMWEBEVAL 2018, 2018, 927 : 52 - 66
  • [2] BioKnowPrompt: Incorporating imprecise knowledge into prompt-tuning verbalizer with biomedical text for relation extraction
    Li, Qing
    Wang, Yichen
    You, Tao
    Lu, Yantao
    INFORMATION SCIENCES, 2022, 617 : 346 - 358
  • [3] TEBC-Net: An Effective Relation Extraction Approach for Simple Question Answering over Knowledge Graphs
    Li, Jianbin
    Qu, Ketong
    Yan, Jingchen
    Zhou, Liting
    Cheng, Long
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 154 - 165
  • [4] A Novel Entity Relation Extraction Approach Based on Micro-Blog
    Zheng, Hao
    Du, YaJun
    Wang, SiDa
    Li, ChenXing
    Yang, JianBo
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 401 - 410
  • [5] Automated Neuroanatomical Relation Extraction: A Linguistically Motivated Approach with a PVT Connectivity Graph Case Study
    Gokdeniz, Erinc
    Ozgur, Arzucan
    Canbeyli, Resit
    FRONTIERS IN NEUROINFORMATICS, 2016, 10
  • [6] Enhanced Heterogeneous Graph Attention Network with a Novel Multilabel Focal Loss for Document-Level Relation Extraction
    Chen, Yang
    Shi, Bowen
    ENTROPY, 2024, 26 (03)
  • [7] Design knowledge graph-aided conceptual product design approach based on joint entity and relation extraction
    Huang, Yuexin
    Yu, Suihuai
    Chu, Jianjie
    Su, Zhaojing
    Zhu, Yaokang
    Wang, Hanyu
    Wang, Mengcheng
    Fan, Hao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (03) : 5333 - 5355
  • [8] Automated tabulation of clinical trial results: A joint entity and relation extraction approach with transformer-based language representations
    Whitton, Jetsun
    Hunter, Anthony
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 144
  • [9] A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
    Yongsun Shim
    Munhwan Lee
    Pil-Jong Kim
    Hong-Gee Kim
    BMC Bioinformatics, 23
  • [10] A novel approach to predicting the synergy of anti-cancer drug combinations using document-based feature extraction
    Shim, Yongsun
    Lee, Munhwan
    Kim, Pil-Jong
    Kim, Hong-Gee
    BMC BIOINFORMATICS, 2022, 23 (01)