MinMax Sampling: A Near-optimal Global Summary for Aggregation in the Wide Area

被引:5
|
作者
Zhao, Yikai [1 ,2 ,3 ]
Zhang, Yinda [1 ,2 ,3 ]
Li, Yuanpeng [1 ,2 ,3 ]
Zhou, Yi [1 ,2 ,3 ]
Chen, Chunhui [1 ,5 ]
Yang, Tong [1 ,2 ,3 ,4 ]
Cui, Bin [1 ,2 ,3 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Peking Univ, Sch Comp Sci, Beijing, Peoples R China
[3] Peking Univ, Natl Engn Lab Big Data Anal Technol & Applicat, Beijing, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
[5] Peking Univ, Sch Software & Microelect, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22) | 2022年
基金
中国国家自然科学基金;
关键词
Sampling; Wide-Area Network; Federated Learning; FREQUENT;
D O I
10.1145/3514221.3526160
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, wide-area data analyses are pervasive with emerging geo-distributed systems. These analyses often need to do the global aggregation in the wide area. Since scarce and variable WAN bandwidth may degrade the aggregation performance, it is highly desired to design a communication scheme for global aggregation in WAN. Unfortunately, no existing algorithm can meet the three design requirements of communication schemes: fast computation, adaptive transmission, and accurate aggregation. In this paper, we propose MinMax Sampling, a fast, adaptive, and accurate communication scheme for global aggregation in WAN. We first focus on the accuracy and design a scheme, namely MinMax(opt), to achieve optimal accuracy. However, MinMax(opt) does not meet the other two requirements: fast computation and adaptive transmission. Based on MinMax(opt), we propose MinMax(adp), which trades little accuracy for the other two requirements. We evaluate MinMax(adp) with three applications: federated learning, distributed state aggregation, and hierarchical aggregation. Our experimental results show that MinMax(adp) is superior to existing algorithms (8.44x better accuracy on average) in all three applications. The source codes of MinMax Sampling are available at Github [1].
引用
收藏
页码:744 / 758
页数:15
相关论文
共 50 条
  • [1] Near-Optimal Regret Bounds for Thompson Sampling
    Agrawal, Shipra
    Goyal, Navin
    JOURNAL OF THE ACM, 2017, 64 (05)
  • [2] Efficient and Near-Optimal Algorithms for Sampling Connected Subgraphs
    Bressan, Marco
    STOC '21: PROCEEDINGS OF THE 53RD ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2021, : 1132 - 1143
  • [3] A minimax near-optimal algorithm for adaptive rejection sampling
    Achddou, Juliette
    Lam-Weil, Joseph
    Carpentier, Alexandra
    Blanchard, Gilles
    ALGORITHMIC LEARNING THEORY, VOL 98, 2019, 98
  • [4] Near-Optimal Random Walk Sampling in Distributed Networks
    Das Sarma, Atish
    Molla, Anisur Rahaman
    Pandurangan, Gopal
    2012 PROCEEDINGS IEEE INFOCOM, 2012, : 2906 - 2910
  • [5] Near-Optimal Graph Signal Sampling by Pareto Optimization
    Luo, Dongqi
    Si, Binqiang
    Zhang, Saite
    Yu, Fan
    Zhu, Jihong
    SENSORS, 2021, 21 (04) : 1 - 13
  • [6] Near-Optimal Dominating Sets via Random Sampling
    Nehez, Martin
    ALGORITHMIC ASPECTS IN INFORMATION AND MANAGEMENT, 2016, 9778 : 162 - 172
  • [7] Near-Optimal Entrywise Sampling of Numerically Sparse Matrices
    Braverman, Vladimir
    Krauthgamer, Robert
    Krishnan, Aditya
    Sapir, Shay
    CONFERENCE ON LEARNING THEORY, VOL 134, 2021, 134 : 759 - 773
  • [8] Sampling-based near-optimal MIMO demodulation algorithms
    Dong, B
    Wang, XD
    Doucet, A
    42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 4214 - 4219
  • [9] Finding Near-Optimal Configurations in Product Lines by Random Sampling
    Oh, Jeho
    Batory, Don
    Myers, Margaret
    Siegmund, Norbert
    ESEC/FSE 2017: PROCEEDINGS OF THE 2017 11TH JOINT MEETING ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2017, : 61 - 71
  • [10] Near-optimal Keypoint Sampling for Fast Pathological Lung Segmentation
    Mansoor, Awais
    Bagci, Ulas
    Mollura, Daniel J.
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 6032 - 6035