Learning the heterogeneous bibliographic information network for literature-based discovery

被引:26
作者
Sebastian, Yakub [1 ]
Siew, Eu-Gene [2 ]
Orimaye, Sylvester Olubolu [1 ]
机构
[1] Monash Univ Malaysia, Sch Informat Technol, Subang Jaya, Selangor, Malaysia
[2] Monash Univ Malaysia, Sch Business, Subang Jaya, Selangor, Malaysia
关键词
Literature-based discovery; Heterogeneous bibliographic information network; Link prediction; LINK-PREDICTION; METHODOLOGY; SIMILARITY; SEARCH;
D O I
10.1016/j.knosys.2016.10.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents HBIN-LBD, a novel literature-based discovery (LBD) method that exploits the lexicocitation structures within the heterogeneous bibliographic information network (HBIN) graphs. Unlike other existing LBD methods, HBIN-LBD harnesses the metapath features found in HBIN graphs for discovering the latent associations between scientific papers published in otherwise disconnected research areas. Further, this paper investigates the effects of incorporating semantic and topic modeling components into the proposed models. Using time-sliced historical bibliographic data, we demonstrate the performance of our method by reconstructing two LBD hypotheses: the Fish Oil and Raynaud's Syndrome hypothesis and the Migraine and Magnesium hypothesis. The proposed method is capable of predicting the future co-citation links between research papers of these previously disconnected research areas with up to 88.86% accuracy and 0.89 F-measure. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 79
页数:14
相关论文
共 59 条
[1]  
[Anonymous], 2003, OXFORD HDB COMP LING
[2]  
[Anonymous], 2011, P 22 AJCAI C
[3]  
[Anonymous], 1998, The Algorithm Design Manual
[4]  
[Anonymous], 1951, P LOND MATH SOC
[5]  
[Anonymous], 2012, Mining Heterogeneous Information Networks: Principles and Methodologies.
[6]  
[Anonymous], SDM06 WORKSH LINK AN
[7]  
[Anonymous], ANN REV INFORM SCI T
[8]  
Banerjee S., 2002, Computational Linguistics and Intelligent Text Processing. Third International Conference, CICLing 2002. Proceedings (Lecture Notes in Computer Science Vol.2276), P136
[9]  
Bassecoulard E., 2005, PATENTS PUBLICATIONS, P665
[10]  
Beamer S, 2013, SCI PROGRAMMING-NETH, V21, P137, DOI [10.1155/2013/702694, 10.3233/SPR-130370]