Scaling and Quality of Modularity Optimization Methods for Graph Clustering

被引:15
|
作者
Ghosh, Sayan [1 ]
Halappanavar, Mahantesh [1 ]
Tumeo, Antonino [1 ]
Kalyanaraman, Ananth [2 ]
机构
[1] Pacific Northwest Natl Lab, Richland, WA 99352 USA
[2] Washington State Univ, Pullman, WA 99164 USA
关键词
D O I
10.1109/hpec.2019.8916299
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Real-world graphs exhibit structures known as "communities" or "clusters" consisting of a group of vertices with relatively high connectivity between them, as compared to the rest of the vertices in the network. Graph clustering or community detection is a fundamental graph operation used to analyze real-world graphs occurring in the areas of computational biology, cybersecurity, electrical grids, etc. Similar to other graph algorithms, owing to irregular memory accesses and inherently sequential nature, current algorithms for community detection are challenging to parallelize. However, in order to analyze large networks, it is important to develop scalable parallel implementations of graph clustering that are capable of exploiting the architectural features of modern supercomputers. In response to the 2019 Streaming Graph Challenge, we present quality and performance analysis of our distributed-memory community detection using Vite, which is our distributed memory implementation of the popular Louvain method, on the ALCF Theta supercomputer. Clustering methods such as Louvain that rely on modularity maximization are known to suffer from the resolution limit problem, preventing identification of clusters of certain sizes. Hence, we also include quality analysis of our shared-memory implementation of the Fast-tracking Resistance method, in comparison with Louvain on the challenge datasets. Furthermore, we introduce an edge-balanced graph distribution for our distributed memory implementation, that significantly reduces communication, offering up to 80% improvement in the overall execution time. In addition to performance/quality analysis, we also include details on the power/energy consumption, and memory traffic of the distributed-memory clustering implementation using real-world graphs with over a billion edges.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] HOW THE RESULT OF GRAPH CLUSTERING METHODS DEPENDS ON THE CONSTRUCTION OF THE GRAPH
    Maier, Markus
    Von Luxburg, Ulrike
    Hein, Matthias
    ESAIM-PROBABILITY AND STATISTICS, 2013, 17 : 370 - 418
  • [32] Graph schemes, graph series, and modularity
    Bringmann, Kathrin
    Jennings-Shaffer, Chris
    Milas, Antun
    JOURNAL OF COMBINATORIAL THEORY SERIES A, 2023, 197
  • [33] Parallel Modularity Clustering
    Fender, Alexandre
    Emad, Nahid
    Petiton, Serge
    Naumov, Maxim
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 1793 - 1802
  • [34] Parallel Modularity Clustering
    Nvidia Corporation, United States
    不详
    不详
    不详
    Procedia Comput. Sci., (1793-1802):
  • [35] Optimization methods for fuzzy clustering
    Fu, GY
    FUZZY SETS AND SYSTEMS, 1998, 93 (03) : 301 - 309
  • [36] Unsupervised feature selection method based on iterative similarity graph factorization and clustering by modularity
    Oliveira, Marcos de S.
    Queiroz, Sergio R. de M.
    de Carvalho, Francisco de A. T.
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 208
  • [37] Automatic Clustering of Inks in Cultural Heritage Artifacts via Optimal Selection of Graph Modularity
    Cao, Bei Grace
    Messinger, David W.
    ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGING XXVII, 2021, 11727
  • [38] Scaling black holes and modularity
    Chattopadhyaya, Aradhita
    Manschot, Jan
    Mondal, Swapnamay
    JOURNAL OF HIGH ENERGY PHYSICS, 2022, 2022 (03)
  • [39] Scaling black holes and modularity
    Aradhita Chattopadhyaya
    Jan Manschot
    Swapnamay Mondal
    Journal of High Energy Physics, 2022
  • [40] Structural Measures of Clustering Quality on Graph Samples
    Zhang, Jianpeng
    Pei, Yulong
    Fletcher, George H. L.
    Pechenizkiy, Mykola
    PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, 2016, : 345 - 348