Scaling and Quality of Modularity Optimization Methods for Graph Clustering

被引:15
|
作者
Ghosh, Sayan [1 ]
Halappanavar, Mahantesh [1 ]
Tumeo, Antonino [1 ]
Kalyanaraman, Ananth [2 ]
机构
[1] Pacific Northwest Natl Lab, Richland, WA 99352 USA
[2] Washington State Univ, Pullman, WA 99164 USA
关键词
D O I
10.1109/hpec.2019.8916299
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Real-world graphs exhibit structures known as "communities" or "clusters" consisting of a group of vertices with relatively high connectivity between them, as compared to the rest of the vertices in the network. Graph clustering or community detection is a fundamental graph operation used to analyze real-world graphs occurring in the areas of computational biology, cybersecurity, electrical grids, etc. Similar to other graph algorithms, owing to irregular memory accesses and inherently sequential nature, current algorithms for community detection are challenging to parallelize. However, in order to analyze large networks, it is important to develop scalable parallel implementations of graph clustering that are capable of exploiting the architectural features of modern supercomputers. In response to the 2019 Streaming Graph Challenge, we present quality and performance analysis of our distributed-memory community detection using Vite, which is our distributed memory implementation of the popular Louvain method, on the ALCF Theta supercomputer. Clustering methods such as Louvain that rely on modularity maximization are known to suffer from the resolution limit problem, preventing identification of clusters of certain sizes. Hence, we also include quality analysis of our shared-memory implementation of the Fast-tracking Resistance method, in comparison with Louvain on the challenge datasets. Furthermore, we introduce an edge-balanced graph distribution for our distributed memory implementation, that significantly reduces communication, offering up to 80% improvement in the overall execution time. In addition to performance/quality analysis, we also include details on the power/energy consumption, and memory traffic of the distributed-memory clustering implementation using real-world graphs with over a billion edges.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Graph Clustering Based on Modularity Variation Estimations
    Martynov, Nikita
    Khandarova, Olga
    Khandarov, Fedor
    BULLETIN OF IRKUTSK STATE UNIVERSITY-SERIES MATHEMATICS, 2018, 25 : 63 - 78
  • [2] An Immunological Algorithm for Graph Modularity Optimization
    Spampinato, A. G.
    Scollo, R. A.
    Cavallaro, S.
    Pavone, M.
    Cutello, V
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS (UKCI 2019), 2020, 1043 : 235 - 247
  • [3] An efficient graph clustering algorithm in signed graph based on modularity maximization
    Zhuo, Kefan
    Yang, Zhuoxuan
    Yan, Guan
    Yu, Kai
    Guo, Wenqiang
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2019, 30 (11):
  • [4] Automatic clustering of multispectral imagery by maximization of the graph modularity
    Mercovich, Ryan A.
    Harkin, Anthony
    Messinger, David
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XVII, 2011, 8048
  • [5] Modularity-based Sparse Soft Graph Clustering
    Hollocou, Alexandre
    Bonald, Thomas
    Lelarge, Marc
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 323 - 332
  • [6] Distributed Graph Clustering Using Modularity and Map Equation
    Hamann, Michael
    Strasser, Ben
    Wagner, Dorothea
    Zeitz, Tim
    EURO-PAR 2018: PARALLEL PROCESSING, 2018, 11014 : 688 - 702
  • [7] Scaling Graph Clustering with Distributed Sketches
    Priest, Benjamin W.
    Dunton, Alec
    Sanders, Geoffrey
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [8] Scaling Fine-grained Modularity Clustering for Massive Graphs
    Shiokawa, Hiroaki
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4597 - 4604
  • [9] Tensor-based Graph Modularity for Text Data Clustering
    Boutalbi, Rafika
    Ait-Saada, Mira
    Iurshina, Anastasiia
    Staab, Steffen
    Nadif, Mohamed
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2227 - 2231
  • [10] Adaptive Local Modularity Learning for Efficient Multilayer Graph Clustering
    Wu, Danyang
    Wang, Penglei
    Liang, Junjie
    Lu, Jitao
    Xu, Jin
    Wang, Rong
    Nie, Feiping
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 2221 - 2232