Community aware random walk for network embedding

被引:71
作者
Keikha, Mohammad Mehdi [1 ,2 ]
Rahgozar, Maseud [1 ]
Asadpour, Masoud [1 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[2] Univ Sistan & Baluchestan, Zahedan, Iran
关键词
Representation learning; Network embedding; Community detection; Skip-gram model; Link prediction; PREDICTION; GRAPH;
D O I
10.1016/j.knosys.2018.02.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social network analysis provides meaningful information about behavior of network members that can be used for diverse applications such as classification, link prediction. However, network analysis is computationally expensive because of feature learning for different applications. In recent years, many researches have focused on feature learning methods in social networks. Network embedding represents the network in a lower dimensional representation space with the same properties which presents a compressed representation of the network. In this paper, we introduce a novel algorithm named "CARE" for network embedding that can be used for different types of networks including weighted, directed and complex. Current methods try to preserve local neighborhood information of nodes, whereas the proposed method utilizes local neighborhood and community information of network nodes to cover both local and global structure of social networks. CARE builds customized paths, which are consisted of local and global structure of network nodes, as a basis for network embedding and uses the Skip-gram model to learn representation vector of nodes. Subsequently, stochastic gradient descent is applied to optimize our objective function and learn the final representation of nodes. Our method can be scalable when new nodes are appended to network without information loss. Parallelize generation of customized random walks is also used for speeding up CARE. We evaluate the performance of CARE on multi label classification and link prediction tasks. Experimental results on various networks indicate that the proposed method outperforms others in both Micro and Macro-fl measures for different size of training data. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:47 / 54
页数:8
相关论文
共 37 条
[1]  
Ahmed Amr, 2013, WWW, P37
[2]  
[Anonymous], 2016, P 22 ACM SIGKDD INT
[3]  
[Anonymous], 2015, LINE: Large-scale information network embedding
[4]  
[Anonymous], 2007, Introduction to statistical relational learning
[5]  
[Anonymous], SUPERVISED RANDOM WA
[6]  
[Anonymous], 2006, COMPUT SCI, DOI DOI 10.4018/JDWM.2007070101
[7]  
[Anonymous], 2009, Social computing data repository at ASU
[8]  
[Anonymous], 2011, Large text compression benchmark
[9]  
[Anonymous], 2014, PROC C EMPIRICAL MET
[10]  
[Anonymous], 2013, P WORKSHOP ICLR 2013