On Parallel k-Center Clustering

被引:1
作者
Coy, Sam [1 ]
Czumaj, Artur [1 ]
Mishra, Gopinath [1 ]
机构
[1] Univ Warwick, Coventry, W Midlands, England
来源
PROCEEDINGS OF THE 35TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, SPAA 2023 | 2023年
基金
英国工程与自然科学研究理事会;
关键词
MPC; k-center; clustering; parallel computing; MapReduce; MAPREDUCE;
D O I
10.1145/3558481.3591075
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the classic k-center problem in a parallel setting, on the low-local-space Massively Parallel Computation (MPC) model, with local space per machine of O(n(delta)), where delta is an element of(0, 1) is an arbitrary constant. As a central clustering problem, the k-center problem has been studied extensively. Still, until very recently, all parallel MPC algorithms have been requiring Omega(k) or even Omega(kn(delta)) local space per machine. While this setting covers the case of small values of k, for a large number of clusters these algorithms require large local memory, making them poorly scalable. The case of large k, k >= Omega(n(delta)), has been considered recently for the low-local-space MPC model by Bateni et al. (2021), who gave an O(log log n)-round MPC algorithm that produces k(1 + o(1)) centers whose cost has multiplicative approximation of O(log log log n). In this paper we extend the algorithm of Bateni et al. and design a low-local-space MPC algorithm that in O(log log..) rounds returns a clustering with k(1 + o(1)) clusters that is an O(log*n)-approximation for k-center.
引用
收藏
页码:65 / 75
页数:11
相关论文
共 23 条
  • [1] Parallel Algorithms for Geometric Graph Problems
    Andoni, Alexandr
    Nikolov, Aleksandar
    Onak, Krzysztof
    Yaroslavtsev, Grigory
    [J]. STOC'14: PROCEEDINGS OF THE 46TH ANNUAL 2014 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2014, : 574 - 583
  • [2] Bateni M, 2021, AAAI CONF ARTIF INTE, V35, P3941
  • [3] Beame P, 2017, J ACM, V64, DOI 10.1145/3125644
  • [4] Blelloch GE, 2010, SPAA '10: PROCEEDINGS OF THE TWENTY-SECOND ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, P315
  • [5] Solving k-center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially
    Ceccarello, Matteo
    Pietracaprina, Andrea
    Pucci, Geppino
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (07): : 766 - 778
  • [6] Chen Jianxu, 2016, ADV NEURAL INFORM PR, V29
  • [7] Coy S, 2023, Arxiv, DOI arXiv:2304.05883
  • [8] Round Compression for Parallel Matching Algorithms
    Czumaj, Artur
    Lacki, Jakub
    Madry, Aleksander
    Mitrovic, Slobodan
    Onak, Krzysztof
    Sankowski, Piotr
    [J]. STOC'18: PROCEEDINGS OF THE 50TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2018, : 471 - 484
  • [9] Mapreduce: Simplified data processing on large clusters
    Dean, Jeffrey
    Ghemawat, Sanjay
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 107 - 113
  • [10] A SIMPLE HEURISTIC FOR THE P-CENTER PROBLEM
    DYER, ME
    FRIEZE, AM
    [J]. OPERATIONS RESEARCH LETTERS, 1985, 3 (06) : 285 - 288