GENERALIZING LLOYD'S ALGORITHM FOR GRAPH CLUSTERING

被引:0
作者
Zaman, Tareq [1 ]
Nytko, Nicolas [2 ]
Taghibakhshi, Ali [3 ]
Maclachlan, Scott [1 ]
Olson, Luke [2 ]
West, Matthew [3 ]
机构
[1] Mem Univ Newfoundland, Interdisciplinary Program Sci Comp, St John, NF A1C 5S7, Canada
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
[3] Univ Illinois Urbana & Champaign, Dept Mech Sci & Engn, Urbana, IL 61801 USA
基金
加拿大自然科学与工程研究理事会;
关键词
clustering; aggregation; multigrid; graph partitioning; AGGREGATION; CONVERGENCE;
D O I
10.1137/23M1556800
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Clustering is a commonplace problem in many areas of data science, with applications in biology and bioinformatics, understanding chemical structure, image segmentation, building recommender systems, and many more fields. While there are many different clustering variants (based on given distance or graph structure, probability distributions, or data density), we consider here the problem of clustering nodes in a graph, motivated by the problem of aggregating discrete degrees of freedom in multigrid and domain decomposition methods for solving sparse linear systems. Specifically, we consider the challenge of forming balanced clusters in the graph of a sparse matrix for use in algebraic multigrid, although the algorithm has general applicability. Based on an extension of the Bellman--Ford algorithm, we generalize Lloyd's algorithm for partitioning subsets of Rn to balance the number of nodes in each cluster; this is accompanied by a rebalancing algorithm that reduces the overall energy in the system. The algorithm provides control over the number of clusters and leads to ``well centered"" partitions of the graph. Theoretical results are provided to establish linear complexity and numerical results in the context of algebraic multigrid highlight the benefits of improved clustering.
引用
收藏
页码:A2819 / A2847
页数:29
相关论文
共 50 条
  • [41] Improved analysis of spectral algorithm for clustering
    Mizutani, Tomohiko
    OPTIMIZATION LETTERS, 2021, 15 (04) : 1303 - 1325
  • [42] Improved analysis of spectral algorithm for clustering
    Tomohiko Mizutani
    Optimization Letters, 2021, 15 : 1303 - 1325
  • [43] Validation indices for graph clustering
    Günter, S
    Bunke, H
    PATTERN RECOGNITION LETTERS, 2003, 24 (08) : 1107 - 1113
  • [44] A LOCAL CLUSTERING ALGORITHM FOR MASSIVE GRAPHS AND ITS APPLICATION TO NEARLY LINEAR TIME GRAPH PARTITIONING
    Spielman, Daniel A.
    Teng, Shang-Hua
    SIAM JOURNAL ON COMPUTING, 2013, 42 (01) : 1 - 26
  • [45] A Tree Segmentation Algorithm for Airborne Light Detection and Ranging Data Based on Graph Theory and Clustering
    Seidl, Jakub
    Kacmarik, Michal
    Klimanek, Martin
    FORESTS, 2024, 15 (07):
  • [46] Clustering-Aware Graph Construction: A Joint Learning Perspective
    Jia, Yuheng
    Liu, Hui
    Hou, Junhui
    Kwong, Sam
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2020, 6 : 357 - 370
  • [47] An Efficient Clustering of Wireless Sensor Network by Spectral Graph Partitioning
    Salman, Sonia
    Ali, Husnain Mansoor
    INTELLIGENT TECHNOLOGIES AND APPLICATIONS, INTAP 2018, 2019, 932 : 698 - 710
  • [48] POINTWISE CONVERGENCE OF THE LLOYD I ALGORITHM IN HIGHER DIMENSION
    Pages, Gilles
    Yu, Jun
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2016, 54 (05) : 2354 - 2382
  • [49] HOW THE RESULT OF GRAPH CLUSTERING METHODS DEPENDS ON THE CONSTRUCTION OF THE GRAPH
    Maier, Markus
    Von Luxburg, Ulrike
    Hein, Matthias
    ESAIM-PROBABILITY AND STATISTICS, 2013, 17 : 370 - 418
  • [50] Clustering based on a near neighbor graph and a grid cell graph
    Chen, Xinquan
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2013, 40 (03) : 529 - 554