PFCA: An influence-based parallel fuzzy clustering algorithm for large complex networks

被引:1
作者
Bhatia, Vandana [1 ]
Rani, Rinkle [1 ]
机构
[1] Thapar Univ, Dept Comp Sci & Engn, Patiala 147004, Punjab, India
关键词
big data; complex networks; fuzzy clustering; PageRank; Pregel; COMMUNITY STRUCTURE; C-MEANS; MODULARITY; MAPREDUCE;
D O I
10.1111/exsy.12295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering helps in understanding the patterns present in networks and thus helps in getting useful insights. In real-world complex networks, analysing the structure of the network plays a vital role in clustering. Most of the existing clustering algorithms identify disjoint clusters, which do not consider the structure of the network. Moreover, the clustering results do not provide consistency and precision. This paper presents an efficient parallel fuzzy clustering algorithm named "PFCA" for large complex networks using Hadoop and Pregel (parallel processing framework for large graphs). The proposed algorithm first selects the candidate cluster heads on the basis of their influence in the network and then determines the number of clusters by analysing the graph structure using PageRank algorithm. The proposed algorithm identifies both disjoint and fuzzy clusters efficiently and finds membership of only those vertices, which are the part of more than one cluster. The performance is validated on 6 real-life networks having up to billions of connections. The experimental results show that the proposed algorithm scales up linearly with the increase in size of network. It is also shown that the proposed algorithm is efficient and has high precision in comparison with the other state-of-art fuzzy clustering algorithms in terms of F score and modularity.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Node clustering in complex networks based on structural similarity
    Feng, Deyue
    Li, Meizhu
    Zhang, Qi
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2025, 658
  • [42] Fuzzy clustering based on Forest optimization algorithm
    Chaghari, Arash
    Feizi-Derakhshi, Mohammad-Reza
    Balafar, Mohammad-Ali
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2018, 30 (01) : 25 - 32
  • [43] A Study of Digital Museum Collection Recommendation Algorithm Based on Improved Fuzzy Clustering Algorithm
    Chen, Yi
    Sun, Jingsong
    Xu, Ziyue
    Zhang, Genglong
    Qi, Naibin
    Song, Yuchen
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2024, 23 (01)
  • [44] GPIC: A GPU-based parallel independent cascade algorithm in complex networks
    Su, Chang
    Na, Xu
    Zhou, Fang
    Lu, Linyuan
    CHINESE PHYSICS B, 2025, 34 (03)
  • [45] Clonal Selection based Parallel Fuzzy Clustering using Map-reduce
    Saneja, Bharti
    Rani, Rinkle
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 442 - 447
  • [46] A Fuzzy Threshold Based Unsupervised Clustering Algorithm for Natural Data Exploration
    Thomas, Binu
    Raju, G.
    2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, : 473 - 477
  • [47] Weighted-spectral clustering algorithm for detecting community structures in complex networks
    Wang, Tzy-Shiah
    Lin, Hui-Tang
    Wang, Ping
    ARTIFICIAL INTELLIGENCE REVIEW, 2017, 47 (04) : 463 - 483
  • [48] Identification of overlapping and non-overlapping community structure by fuzzy clustering in complex networks
    Sun, Peng Gang
    Gao, Lin
    Han, Shan Shan
    INFORMATION SCIENCES, 2011, 181 (06) : 1060 - 1071
  • [49] Parallel Diffrential Evolution Clustering Algorithm based on MapReduce
    Daoudi, Meroua
    Hamena, Soumiya
    Benmounah, Zakaria
    Batouche, Mohamed
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 337 - 341