sonLP: Social Network Link Prediction by Principal Component Regression

被引:0
作者
Bao, Zhifeng [1 ]
Zeng, Yong [1 ]
Tay, Y. C. [1 ]
机构
[1] Natl Univ Singapore, Singapore 117548, Singapore
来源
2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM) | 2013年
关键词
link prediction; imbalanced samples;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social networks are driven by social interaction and therefore dynamic. When modeled as a graph, nodes and links are continually added and deleted, and there is considerable interest in social network analysis on predicting link formation. Current work has not adequately addressed three issues: (1) Most link predictors start with using features from the link topology as input. How do features in other dimensions of the social network data affect link formation? (2) The dynamic nature of social networks implies the features driving link formation are constantly changing. How can a predictor automatically select the features that are important for link formation? (3) Node pairs that are not linked can outnumber links by orders of magnitude, but previous work do not address this imbalance. How can we design a predictor that is robust with respect to link imbalance? This paper presents sonLP, a social network link predictor. It uses principal component analysis to identify features that are important to link prediction, its tradeoff between true and false positives is near optimal for a wide range of link imbalance, and it has optimal time complexity. Experiments with coauthorship prediction in the ACM researcher community also show the importance of using features outside the links' dimension.
引用
收藏
页码:370 / 377
页数:8
相关论文
共 24 条
[1]   Friends and neighbors on the Web [J].
Adamic, LA ;
Adar, E .
SOCIAL NETWORKS, 2003, 25 (03) :211-230
[2]  
Al Hasan M, 2011, SOCIAL NETWORK DATA ANALYTICS, P243
[3]  
[Anonymous], 2011, PROC 4 ACM INT C WEB, DOI DOI 10.1145/1935826.1935914
[4]  
[Anonymous], 2005, SIGKDD Explor. Newsl
[5]  
[Anonymous], 2008, WSDM
[6]  
Doppa J. R., P WORKSH AN NETW LEA
[7]  
Fire M., 2011, Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and IEEE Third International Conference on Social Computing (PASSAT/SocialCom 2011), P73, DOI 10.1109/PASSAT/SocialCom.2011.20
[8]   The Unreasonable Effectiveness of Data [J].
Halevy, Alon ;
Norvig, Peter ;
Pereira, Fernando .
IEEE INTELLIGENT SYSTEMS, 2009, 24 (02) :8-12
[9]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[10]  
Hasan M.A., 2006, P 4 WORKSH LINK AN C, P1