Disjoint and Overlapping Community Detection in Small-World Networks Leveraging Mean Path Length

被引:14
作者
Ghoshal, Arnab Kumar [1 ]
Das, Nabanita [2 ]
Das, Soham [3 ]
机构
[1] Asutosh Coll, Dept Comp Sci, Kolkata 700026, India
[2] Indian Stat Inst Kolkata, Adv Comp & Microelect Unit, Kolkata 700108, India
[3] Microsoft Corp, Redmond, WA 98052 USA
关键词
Convergence; Genetic algorithms; Upper bound; Indexes; Social networking (online); Knowledge engineering; Clustering algorithms; Disjoint community set (DCS); general-purpose graphic-processor units (GP-GPUs); genetic algorithms (GAs); mean path length; overlapping community set (OCS); small-world networks; GENETIC ALGORITHM; MEMBERSHIP; GA;
D O I
10.1109/TCSS.2021.3093038
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent developments on identifying the community structures in real-world networks have established that the community structure may be either disjoint community set (DCS) or overlapping community set (OCS), showing high resemblance with each other. Still, given a network, researchers mostly followed distinct approaches to achieve optimal solutions, either for DCS or for OCS. Moreover, prior knowledge of community structure is needed to select the appropriate class of algorithms, since one cannot produce the optimal solution for the other. In this article, a comprehensive two-phase approach based on genetic algorithm (GA) is proposed that can be applied to any small-world network to generate the DCS and the OCS very fast without any prior knowledge of the community structure. In the first phase, an upper bound on the mean path length of a community is applied, relative to the equivalent Erdos-Renyi (E-R) random graph that expedites the convergence significantly. In the second phase, the search space is reduced considerably, by selecting a smaller subset of boundary nodes of the DCS generated in the first phase, to be manipulated probabilistically. To the best of our knowledge, we are the first to consider the mean path length of the community as a key parameter for finding the good quality communities at the earliest. Experimental study on six synthetic networks and five real-world networks shows that the proposed approach not only outperforms the state-of-the-art algorithms in terms of quality and scalability, but its parallel implementation also improves the speedup significantly.
引用
收藏
页码:406 / 418
页数:13
相关论文
共 57 条
[1]  
[Anonymous], 1961, Acta Math. Acad. Sci. Hungar., DOI DOI 10.1007/BF02066689
[2]   Genetic algorithm-based community detection in large-scale social networks [J].
Behera, Ranjan Kumar ;
Naik, Debadatta ;
Rath, Santanu Kumar ;
Dharavath, Ramesh .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13) :9649-9665
[3]   A Multi-Objective Genetic Algorithm for overlapping community detection based on edge encoding [J].
Bello-Orgaz, Gema ;
Salcedo-Sanz, Sancho ;
Camacho, David .
INFORMATION SCIENCES, 2018, 462 :290-314
[4]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[5]  
Bollobas B., 2001, Random Graphs, VSecond
[6]   On modularity clustering [J].
Brandes, Ulrik ;
Delling, Daniel ;
Gaertler, Marco ;
Goerke, Robert ;
Hoefer, Martin ;
Nikoloski, Zoran ;
Wagner, Dorothea .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (02) :172-188
[7]   A node-priority based large-scale overlapping community detection using evolutionary multi-objective optimization [J].
Chai, Zhengyi ;
Liang, Shijiao .
EVOLUTIONARY INTELLIGENCE, 2020, 13 (01) :59-68
[8]  
Chakraborty G., 2017, P INT S NONL THEOR A, P334
[9]   On the Permanence of Vertices in Network Communities [J].
Chakraborty, Tanmoy ;
Srinivasan, Sriram ;
Ganguly, Niloy ;
Mukherjee, Animesh ;
Bhowmick, Sanjukta .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :1396-1405
[10]   Leveraging disjoint communities for detecting overlapping community structure [J].
Chakraborty, Tanmoy .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2015,