PLDLS: A novel parallel label diffusion and label Selection-based community detection algorithm based on Spark in social networks

被引:23
作者
Roghani, Hamid [1 ]
Bouyer, Asgarali [1 ]
Nourani, Esmaeil [1 ]
机构
[1] Azarbaijan Shahid Madani Univ, Dept Comp Engn, Tabriz, Iran
关键词
Parallel community detection; Label diffusion; Local similarity; Label selection; Spark; Social networks; COMPLEX NETWORKS; PROPAGATION ALGORITHM; LOCAL SIMILARITY; MODULARITY; NODES; ORGANIZATION; GRAPH;
D O I
10.1016/j.eswa.2021.115377
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parallel and distributed community detection in large-scale complex networks, such as social networks, is a challenging task. Parallel and distributed algorithm with high accuracy and low computational complexity is one of the essential issues in the community detection field. In this paper, we propose a novel fast, and accurate Spark-based parallel label diffusion and label selection-based (PLDLS) community detection algorithm with twostep of label diffusion of core nodes along with a new label selection (propagation) method. We have used multi factor criteria for computing node's importance and adopted a new method for selecting core nodes. In the first phase, utilizing the fact that nodes forming triangles, tend to be in the same community, parallel label diffusion of core nodes is performed to diffuse labels up to two levels. In the second phase, through an iterative and parallel process, the most appropriate labels are assigned to the remaining nodes. PLDLS proposes an improved robust version of LPA by putting aside randomness parameter tuning. Furthermore, we utilize a fast and parallel merge phase to get even more dense and accurate communities. Conducted experiments on real-world and artificial networks, indicates the better accuracy and low execution time of PLDLS in comparison with other examined methods.
引用
收藏
页数:24
相关论文
共 95 条
  • [41] Leskovec J., 2005, P 11 ACM SIGKDD INT, P177
  • [42] Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters
    Leskovec, Jure
    Lang, Kevin J.
    Dasgupta, Anirban
    Mahoney, Michael W.
    [J]. INTERNET MATHEMATICS, 2009, 6 (01) : 29 - 123
  • [43] Detecting community structure via synchronous label propagation
    Li, Shenghong
    Lou, Hao
    Jiang, Wen
    Tang, Junhua
    [J]. NEUROCOMPUTING, 2015, 151 : 1063 - 1075
  • [44] The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations - Can geographic isolation explain this unique trait?
    Lusseau, D
    Schneider, K
    Boisseau, OJ
    Haase, P
    Slooten, E
    Dawson, SM
    [J]. BEHAVIORAL ECOLOGY AND SOCIOBIOLOGY, 2003, 54 (04) : 396 - 405
  • [45] PSPLPA: Probability and similarity based parallel label propagation algorithm on spark
    Ma, Tinghuai
    Yue, Mingliang
    Qu, Jingjing
    Tian, Yuan
    Al-Dhelaan, Abdullah
    Al-Rodhaan, Mznah
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 503 : 366 - 378
  • [46] Finding community structure in networks using the eigenvectors of matrices
    Newman, M. E. J.
    [J]. PHYSICAL REVIEW E, 2006, 74 (03)
  • [47] Modularity and community structure in networks
    Newman, M. E. J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (23) : 8577 - 8582
  • [48] Why social networks are different from other types of networks
    Newman, MEJ
    Park, J
    [J]. PHYSICAL REVIEW E, 2003, 68 (03) : 8
  • [49] The structure of scientific collaboration networks
    Newman, MEJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (02) : 404 - 409
  • [50] Detecting community structure in complex networks via node similarity
    Pan, Ying
    Li, De-Hua
    Liu, Jian-Guo
    Liang, Jing-Zhang
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2010, 389 (14) : 2849 - 2857