Link Prediction in Microblog Network Using Supervised Learning with Multiple Features

被引:7
作者
Han, Siyao [1 ]
Xu, Yan [1 ]
机构
[1] Beijing Language & Culture Univ, Dept Informat Sci, Beijing, Peoples R China
关键词
Link prediction; microblog; feature extraction; supervised learning;
D O I
10.17706/jcp.11.1.72-82
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Link prediction ( LP) is a fundamental network analysis task. It aims to analyze the existing links and predict the missing or potential relations between users in a social network. It can help users in finding new friends, enhance their loyalties to the web sites and build a healthy social environment. In previous researches, much attention was focused on structure information or node attributes, in order to analyze the global or local properties. Considering the nature of Microblog social network, we proposed a link prediction system combining multiple features from different perspectives, and learn a classifier from these feature subsets to predict the potential links. We train classifiers using SVM, Naive Bayes, and Random Forest and Logistic Regression algorithms and evaluate them using the microblog network dataset. The results show that our features perform better than the traditional features, and the combination of multiple features can achieve highest accuracy.
引用
收藏
页码:72 / 82
页数:11
相关论文
共 25 条
[1]   Friends and neighbors on the Web [J].
Adamic, LA ;
Adar, E .
SOCIAL NETWORKS, 2003, 25 (03) :211-230
[2]  
[Anonymous], PHYS REV
[3]  
Bilgic M, 2007, P 7 IEEE INT C DAT M, P381
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Fire M., 2011, Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and IEEE Third International Conference on Social Computing (PASSAT/SocialCom 2011), P73, DOI 10.1109/PASSAT/SocialCom.2011.20
[6]  
Gettor L., 2005, SIGKDD EXPLORATIONS, V7
[7]  
Hasan Mohammad Al, 2006, SDM06 WORKSHOP LINK
[8]  
Huang Z., 2005, P 5 ACM IEEE CS JOIN
[9]  
Jia J., 2013, P 22 INT C WORLD WID, P93
[10]  
Kim J. G., 2014, T MAT PROCESSING, V23, P23