Probability Matrix Factorization for Link Prediction Based on Information Fusion

被引:1
作者
Wang Z. [1 ]
Liang J. [1 ,2 ]
Li R. [1 ,2 ]
机构
[1] School of Computer & Information Technology, Shanxi University, Taiyuan
[2] Key Laboratory of Computation Intelligence & Chinese Information Processing, Shanxi University, Ministry of Education, Taiyuan
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2019年 / 56卷 / 02期
基金
中国国家自然科学基金;
关键词
Fusion model; Link prediction; Network data analysis; Probability matrix factorization; Social-information network;
D O I
10.7544/issn1000-1239.2019.20170746
中图分类号
学科分类号
摘要
As one kind of typical network big data, social-information networks such as Weibo and Twitter include both the complex network structure among users and rich microblog/Tweet information published by users. It is notable that most of the existing methods only make use of the network topological information or the non-topological information for link prediction, but there is still a lack of effective methods by fusing the topological information or non-topological information in social-information networks. A link prediction method is proposed from the perspective of users' topic by fusing users' topic similarity in social-information networks. The method goes in accordance with the following sequence: firstly, a topic similarity between users based on users' topic representation is defined, followed by which a topic similarity-based sparse network is constructed; secondly, the information of the following/followed network and the topic similarity-based network are fused into a unified framework of probabilistic matrix factorization, based on which the latent-feature representation of the network nodes and the linking relation parameters are obtained; finally, the linking probability between network nodes is calculated based on the obtained latent-feature representation and linking relation parameters. The proposed approach provides a general modeling strategy fusing multi-network information and a learning-based solution. Link prediction experiments are conducted on four real network datasets, i. e. Twitter and Weibo. The experimental results demonstrate that the proposed method is more effective than others. © 2019, Science Press. All right reserved.
引用
收藏
页码:306 / 318
页数:12
相关论文
共 50 条
[41]  
Lee D.D., Seung H.S., Algorithms for non-negative matrix factorization, Proc of the 7th Int Conf on Neural Information Processing Systems, pp. 556-562, (2000)
[42]  
Salakhutdinov R., Mnih A., Probabilistic matrix factorization, Proc of the 14th Int Conf on Neural Information Processing Systems, pp. 1257-1264, (2007)
[43]  
Aral S., Muchnik L., Sundararajan A., Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks, Proceedings of the National Academy of Sciences of the United States of America, 106, 51, pp. 21544-21549, (2009)
[44]  
Wu X., Li Y., Li L., Influence analysis of online social networks, Chinese Journal of Computers, 37, 6, pp. 735-752, (2014)
[45]  
Blei D.M., Ng A.Y., Jordan M.I., Latent dirichlet allocation, Journal of Machine Learning Research, 3, pp. 993-1022, (2003)
[46]  
Kullback S., The Kullback-Leibler distance, American Statistician, 41, 4, pp. 340-341, (1987)
[47]  
Zhu S., Yu K., Chi Y., Et al., Combining content and link for classification using matrix factorization, Proc of the 30th Annual Int Conf on Research and Development in Information Retrieval, pp. 487-494, (2007)
[48]  
Zhang J., Liu B., Tang J., Et al., Social influence locality for modeling retweeting behaviors, Proc of the 22nd Int Joint Conf on Artificial Intelligence, pp. 2761-2767, (2013)
[49]  
Pedregosa F., Gramfort A., Michel V., Et al., Scikit-learn: Machine learning in python, Journal of Machine Learning Research, 12, 10, pp. 2825-2830, (2011)
[50]  
Fawcett T., An introduction to ROC analysis, Pattern Recognition Letters, 27, 8, pp. 861-874, (2006)