Scaling and Quality of Modularity Optimization Methods for Graph Clustering

被引:15
|
作者
Ghosh, Sayan [1 ]
Halappanavar, Mahantesh [1 ]
Tumeo, Antonino [1 ]
Kalyanaraman, Ananth [2 ]
机构
[1] Pacific Northwest Natl Lab, Richland, WA 99352 USA
[2] Washington State Univ, Pullman, WA 99164 USA
关键词
D O I
10.1109/hpec.2019.8916299
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Real-world graphs exhibit structures known as "communities" or "clusters" consisting of a group of vertices with relatively high connectivity between them, as compared to the rest of the vertices in the network. Graph clustering or community detection is a fundamental graph operation used to analyze real-world graphs occurring in the areas of computational biology, cybersecurity, electrical grids, etc. Similar to other graph algorithms, owing to irregular memory accesses and inherently sequential nature, current algorithms for community detection are challenging to parallelize. However, in order to analyze large networks, it is important to develop scalable parallel implementations of graph clustering that are capable of exploiting the architectural features of modern supercomputers. In response to the 2019 Streaming Graph Challenge, we present quality and performance analysis of our distributed-memory community detection using Vite, which is our distributed memory implementation of the popular Louvain method, on the ALCF Theta supercomputer. Clustering methods such as Louvain that rely on modularity maximization are known to suffer from the resolution limit problem, preventing identification of clusters of certain sizes. Hence, we also include quality analysis of our shared-memory implementation of the Fast-tracking Resistance method, in comparison with Louvain on the challenge datasets. Furthermore, we introduce an edge-balanced graph distribution for our distributed memory implementation, that significantly reduces communication, offering up to 80% improvement in the overall execution time. In addition to performance/quality analysis, we also include details on the power/energy consumption, and memory traffic of the distributed-memory clustering implementation using real-world graphs with over a billion edges.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Multi-threaded modularity based graph clustering using the multilevel paradigm
    LaSalle, Dominique
    Karypis, George
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 76 : 66 - 80
  • [22] Graph modularity maximization as an effective method for co-clustering text data
    Ailem, Melissa
    Role, Francois
    Nadif, Mohamed
    KNOWLEDGE-BASED SYSTEMS, 2016, 109 : 160 - 173
  • [23] Optimizing an organized modularity measure for topographic graph clustering: A deterministic annealing approach
    Rossi, Fabrice
    Villa-Vialaneix, Nathalie
    NEUROCOMPUTING, 2010, 73 (7-9) : 1142 - 1163
  • [24] Spectral methods for graph clustering - A survey
    Nascimento, Maria C. V.
    de Carvalho, Andre C. P. L. F.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 211 (02) : 221 - 231
  • [25] KEY FRAMES EXTRACTION USING GRAPH MODULARITY CLUSTERING FOR EFFICIENT VIDEO SUMMARIZATION
    Gharbi, Hana
    Bahroun, Sahbi
    Massaoudi, Mohamed
    Zagrouba, Ezzeddine
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1502 - 1506
  • [26] Assessing the quality of multilevel graph clustering
    Queyroi, Francois
    Delest, Maylis
    Fedou, Jean-Marc
    Melancon, Guy
    DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (04) : 1107 - 1128
  • [27] Axioms for graph clustering quality functions
    Van Laarhoven, Twan
    Marchiori, Elena
    Journal of Machine Learning Research, 2014, 15 : 193 - 215
  • [28] Axioms for Graph Clustering Quality Functions
    van Laarhoven, Twan
    Marchiori, Elena
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 193 - 215
  • [29] Assessing the quality of multilevel graph clustering
    François Queyroi
    Maylis Delest
    Jean-Marc Fédou
    Guy Melançon
    Data Mining and Knowledge Discovery, 2014, 28 : 1107 - 1128
  • [30] Modularity and Graph Expansion
    Louf, Baptiste
    McDiarmid, Colin
    Skerman, Fiona
    15TH INNOVATIONS IN THEORETICAL COMPUTER SCIENCE CONFERENCE, ITCS 2024, 2024,