A Unified Link Prediction Framework for Predicting Arbitrary Relations in Heterogeneous Academic Networks

被引:7
作者
Lu, Meilian [1 ]
Wei, Xudan [1 ]
Ye, Danna [1 ]
Dai, Yinlong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Heterogeneous academic networks; link prediction; meta-path; random walk; similarity measure; RELEVANCE MEASURE;
D O I
10.1109/ACCESS.2019.2939172
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the existing link prediction methods for heterogeneous academic networks can only predict one or two specific relation types rather than arbitrary relation types. Although several recently proposed methods have involved multi-relational prediction problems, they do not comprehensively consider the rich semantic or temporal information of heterogeneous academic networks. Considering that researchers may have diverse requirements for different types of academic resources, in this study, we propose a new unified link prediction framework (UniLPF) for arbitrary types of academic relations. First, a weighted and directed heterogeneous academic network containing rich academic objects and relations is constructed. Then, an automatic meta-path searching method is proposed to extract the meta-paths for arbitrary prediction tasks. Two meta-path based object similarity measures combining temporal information and content relevance are also proposed to measure the features of the meta-paths. Finally, a pervasive link prediction model is built, which can be embodied based on an arbitrarily specified prediction task and the corresponding meta-path features. Extensive experiments for predicting various relation types with practical significance are conducted on a large-scale Microsoft Academic dataset. The experimental results demonstrate that our proposed UniLPF framework can predict arbitrary specified academic relations, and outperforms the comparison methods in terms of F-measure, accuracy, AUC and ROC. In addition, the time scalability experiments prove that UniLPF also achieves good performance for predicting the academic relations over time.
引用
收藏
页码:124967 / 124987
页数:21
相关论文
共 46 条
[1]  
Allali O., 2011, IEEE INFOCOM 2011 - IEEE Conference on Computer Communications. Workshops, P936, DOI 10.1109/INFCOMW.2011.5928947
[2]  
[Anonymous], 2004, SIGMOD
[3]  
[Anonymous], 2014, 20 ACM SIGKDD INT C, DOI DOI 10.1145/2623330.2623732
[4]   Measuring Similarity Similarly: LDA and Human Perception [J].
Ben Towne, W. ;
Rose, Carolyn P. ;
Herbsleb, James D. .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 8 (01)
[5]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]   Collective Prediction of Multiple Types of Links in Heterogeneous Information Networks [J].
Cao, Bokai ;
Kong, Xiangnan ;
Yu, Philip S. .
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, :50-59
[7]   PME: Projected Metric Embedding on Heterogeneous Networks for Link Prediction [J].
Chen, Hongxu ;
Yin, Hongzhi ;
Wang, Weiqing ;
Wang, Hao ;
Quoc Viet Hung Nguyen ;
Li, Xue .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1177-1186
[8]  
Chowdhurry G. G., 1983, INTRO MODERN INFORM
[9]   Multi-Relational Link Prediction in Heterogeneous Information Networks [J].
Davis, Darcy ;
Lichtenwalter, Ryan ;
Chawla, Nitesh V. .
2011 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2011), 2011, :281-288
[10]  
Gallinari P., 2011, P 20 ACM INT C INF K, P1169