PFCA: An influence-based parallel fuzzy clustering algorithm for large complex networks

被引:1
|
作者
Bhatia, Vandana [1 ]
Rani, Rinkle [1 ]
机构
[1] Thapar Univ, Dept Comp Sci & Engn, Patiala 147004, Punjab, India
关键词
big data; complex networks; fuzzy clustering; PageRank; Pregel; COMMUNITY STRUCTURE; C-MEANS; MODULARITY; MAPREDUCE;
D O I
10.1111/exsy.12295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering helps in understanding the patterns present in networks and thus helps in getting useful insights. In real-world complex networks, analysing the structure of the network plays a vital role in clustering. Most of the existing clustering algorithms identify disjoint clusters, which do not consider the structure of the network. Moreover, the clustering results do not provide consistency and precision. This paper presents an efficient parallel fuzzy clustering algorithm named "PFCA" for large complex networks using Hadoop and Pregel (parallel processing framework for large graphs). The proposed algorithm first selects the candidate cluster heads on the basis of their influence in the network and then determines the number of clusters by analysing the graph structure using PageRank algorithm. The proposed algorithm identifies both disjoint and fuzzy clusters efficiently and finds membership of only those vertices, which are the part of more than one cluster. The performance is validated on 6 real-life networks having up to billions of connections. The experimental results show that the proposed algorithm scales up linearly with the increase in size of network. It is also shown that the proposed algorithm is efficient and has high precision in comparison with the other state-of-art fuzzy clustering algorithms in terms of F score and modularity.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] A Seed Growth Algorithm for Local Clustering in Complex Networks
    Tsai, Feng-Sheng
    Hsu, Sheng-Yi
    Shih, Mau-Hsiang
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5878 - 5891
  • [22] LP-LPA: A link influence-based label propagation algorithm for discovering community structures in networks
    Berahmand, Kamal
    Bouyer, Asgarali
    INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 2018, 32 (06):
  • [23] Semi-supervised clustering algorithm for community structure detection in complex networks
    Ma, Xiaoke
    Gao, Lin
    Yong, Xuerong
    Fu, Lidong
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2010, 389 (01) : 187 - 197
  • [24] Fuzzy Clustering in a Complex Network Based on Content Relevance and Link Structures
    Hu, Lun
    Chan, Keith C. C.
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2016, 24 (02) : 456 - 470
  • [25] A Similarity Based Agglomerative Clustering Algorithm in Networks
    Liu, Zhiyuan
    Wang, Xiujuan
    Ma, Yinghong
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [26] An Improved K-means Clustering Algorithm for Complex Networks
    Li, Hao
    Wang, Haoxiang
    Chen, Zengxian
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND ELECTRONIC TECHNOLOGY, 2015, 3 : 90 - 93
  • [27] DFuzzy: a deep learning-based fuzzy clustering model for large graphs
    Vandana Bhatia
    Rinkle Rani
    Knowledge and Information Systems, 2018, 57 : 159 - 181
  • [28] Community mining in complex networks - clustering combination based genetic algorithm
    He D.-X.
    Zhou X.
    Wang Z.
    Zhou C.-G.
    Wang Z.
    Jin D.
    Zidonghua Xuebao/Acta Automatica Sinica, 2010, 36 (08): : 1160 - 1170
  • [29] Fuzzy clustering algorithm based on multiple medoids for large-scale data
    Chen A.-G.
    Wang S.-T.
    Kongzhi yu Juece/Control and Decision, 2016, 31 (12): : 2122 - 2130
  • [30] PCPD: A Parallel Crime Pattern Discovery System for Large-Scale Spatiotemporal Data Based on Fuzzy Clustering
    Khin Nandar Win
    Jianguo Chen
    Yuedan Chen
    Philippe Fournier-Viger
    International Journal of Fuzzy Systems, 2019, 21 : 1961 - 1974