Exploring Self-Supervised Graph Learning in Literature-Based Discovery

被引:1
作者
Ding, Juncheng [1 ]
Jin, Wei [1 ]
机构
[1] Univ North Texas, Dept Comp Sci & Engn, Denton, TX 76203 USA
来源
2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021) | 2021年
基金
美国国家科学基金会;
关键词
text mining; literature-based discovery; bioinformatics; graph neural network; self-supervised learning; CONNECTIONS;
D O I
10.1109/ICHI52183.2021.00021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Literature-Based Discovery (LBD) aims to find associations, as one or more connecting concepts, of two "unrelated" concepts hidden in the literature. The most recent studies fulfilled the task by learning the representations of the concepts and using the representations' similarities to decide whether a connecting concept is a reasonable one. However, these approaches cannot handle "complex" associations with multiple connecting concepts properly. To address this issue, we propose a neural network model LBDSetNet which can assign a "credibility" score to a plausible association with either one or more connecting concepts. By unifying both the literature and the candidate associations as bags of concepts, we can generate "less credible" literature and train LBDSetNet by enforcing it to distinguish the generated and original literature, overcoming the lack of labeled associations. We also propose a new double-margin cost function for better model training by generating "more credible" documents. We experiment to show that our model can find "complex" associations effectively and efficiently. Comparative experiments reveal that the LBDSetNet solution performs significantly better than the previously proposed models on "simple" associations. The double margin cost function also proves its advantage in model training.
引用
收藏
页码:53 / 62
页数:10
相关论文
共 29 条
[1]   T-PAIR: Temporal Node-Pair Embedding for Automatic Biomedical Hypothesis Generation [J].
Akujuobi, Uchenna ;
Spranger, Michael ;
Palaniappan, Sucheendra K. ;
Zhang, Xiangliang .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) :2988-3001
[2]   SYMPATHECTOMY FOR ESSENTIAL HYPERTENSION [J].
ALLEN, EV .
CIRCULATION, 1952, 6 (01) :131-140
[3]  
[Anonymous], 2016, P 8 INT C BIOINF COM
[4]   MeSH: a window into full text for document summarization [J].
Bhattacharya, Sanmitra ;
Viet Ha-Thuc ;
Srinivasan, Padmini .
BIOINFORMATICS, 2011, 27 (13) :I120-I128
[5]   A graph-based recovery and decomposition of Swanson's hypothesis using semantic predications [J].
Cameron, Delroy ;
Bodenreider, Olivier ;
Yalamanchili, Hima ;
Tu Danh ;
Vallabhaneni, Sreeram ;
Thirunarayan, Krishnaprasad ;
Sheth, Amit P. ;
Rindflesch, Thomas C. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (02) :238-251
[6]   Beyond triplet loss: a deep quadruplet network for person re-identification [J].
Chen, Weihua ;
Chen, Xiaotang ;
Zhang, Jianguo ;
Huang, Kaiqi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329
[7]  
Clement D L, 1980, Verh K Acad Geneeskd Belg, V42, P164
[8]  
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, DOI 10.48550/ARXIV.1810.04805]
[9]  
Ding JC, 2019, IEEE INT C BIOINFORM, P362, DOI 10.1109/BIBM47256.2019.8983400
[10]   DIGITAL ARTERY SYMPATHECTOMY [J].
FLATT, AE .
JOURNAL OF HAND SURGERY-AMERICAN VOLUME, 1980, 5 (06) :550-556