An Overlapping Community Detection Algorithm Based on Triangle Reduction Weighted for Large-Scale Complex Network

被引:1
作者
Zhang, Hanning [1 ,2 ,5 ]
Dong, Bo [3 ,4 ,5 ]
Feng, Boqin [1 ,5 ]
Wu, Haiyu [1 ,5 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Shaanxi Prov Key Lab Satellite & Terr Network Tec, Xian 710049, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Continuing Educ, Xian 710049, Peoples R China
[4] Xi An Jiao Tong Univ, Natl Engn Lab Big Data Analyt, Xian 710049, Peoples R China
[5] Xian Network Comp Data Technol Co Ltd, Xian 710049, Peoples R China
来源
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT I | 2020年 / 12452卷
关键词
Triangle reduction; Multi-label propagation; Complex networks; Community detection; GENETIC ALGORITHM; GA;
D O I
10.1007/978-3-030-60245-1_43
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this digital age, dramatically developed internet makes the data of complex networks appear an explosive growth, which aggrandizes the importance of multilevel community detection used in large-scale complex networks. Nowadays, there are so many community detection algorithms that could perform well on accuracy. However, none of them has an expected function to handle the time increasing problem which is caused by the inflated network scale. Hence, we propose An Overlapping Community Detection Algorithm based on Triangle Reduction Weighted for Large-scale Complex Network (TRWLPA). It consists of two main steps: 1) Transforming the original network to a small-scale triangle reduction network. This network could not only dramatically reduce the running time of community detection, but recover the original network structure by the inverse transformation. Moreover, the scale of the triangle reduction network could be controlled by setting the iteration times. 2) Doing the multi-label propagation on the reduced networks where the weight of each node is the number of initial nodes it contains. The experiments illustrate that the TRWLPA algorithm significantly reduces the running time of community detection on Youtube, DBLP, and LiveJournal datasets. Particularly, comparing with the MOSES algorithm, it achieves 98.1% running time reduction on the Youtube dataset. Furthermore, our algorithm performs well on both modularity and Normalized Mutual Information measure (NMI).
引用
收藏
页码:627 / 644
页数:18
相关论文
共 24 条
[1]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[2]  
Chang PC, 2013, PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON THE MODERN DEVELOPMENT OF HUMANITIES AND SOCIAL SCIENCE, P25
[3]   Detecting functional modules in the yeast protein-protein interaction network [J].
Chen, Jingchun ;
Yuan, Bo .
BIOINFORMATICS, 2006, 22 (18) :2283-2290
[4]  
Chien-Cheng Lee, 2010, 2010 International Computer Symposium (ICS 2010), P1, DOI 10.1109/COMPSYM.2010.5685519
[5]  
Ereteo G., 2011, 2011 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies, P324, DOI 10.1109/WI-IAT.2011.98
[6]   Finding overlapping communities in networks by label propagation [J].
Gregory, Steve .
NEW JOURNAL OF PHYSICS, 2010, 12
[7]  
Hu W, 2013, FINDING STAT SIGN CO
[8]  
Jierui Xie, 2012, Advances in Knowledge Discovery and Data Mining. Proceedings 16th Pacific-Asia Conference (PAKDD 2012), P25, DOI 10.1007/978-3-642-30220-6_3
[9]  
[康颖 Kang Ying], 2016, [计算机学报, Chinese Journal of Computers], V39, P169
[10]  
Kernighan B. W., 1970, Bell System Technical Journal, V49, P291