Communication-Efficient and Byzantine-Robust Distributed Learning

被引:9
作者
Ghosh, Avishek [1 ]
Maity, Raj Kumar [2 ]
Kadhe, Swanand [1 ]
Mazumdar, Arya [2 ]
Ramchandran, Kannan [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] UMASS Amherst, Coll Informat & Comp Sci, Amherst, MA USA
来源
2020 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA) | 2020年
基金
美国国家科学基金会;
关键词
D O I
10.1109/ita50056.2020.9245017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop a communication-efficient distributed learning algorithm that is robust against Byzantine worker machines. We propose and analyze a distributed gradient-descent algorithm that performs a simple thresholding based on gradient norms to mitigate Byzantine failures. We show the (statistical) error-rate of our algorithm matches that of [YCKB18], which uses more complicated schemes (like coordinate-wise median or trimmed mean) and thus optimal. Furthermore, for communication efficiency, we consider a generic class of delta-approximate compressors from [KRSJ19] that encompasses sign-based compressors and top-k sparsification. Our algorithm uses compressed gradients and gradient norms for aggregation and Byzantine removal respectively. We establish the statistical error rate of the algorithm for arbitrary (convex or non-convex) smooth loss function. We show that, in the regime when the compression factor delta is constant and the dimension of the parameter space is fixed, the rate of convergence is not affected by the compression operation, and hence we effectively get the compression for free. Moreover, we extend the compressed gradient descent algorithm with error feedback proposed in [KRSJ19] for the distributed setting. We have experimentally validated our results and shown good performance in convergence for convex (least-square regression) and non-convex (neural network training) problems.
引用
收藏
页数:28
相关论文
共 50 条
[41]   Communication-Efficient Distributed Cooperative Learning With Compressed Beliefs [J].
Toghani, Mohammad Taha ;
Uribe, Cesar A. .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (03) :1215-1226
[42]   Communication-Efficient Robust Federated Learning with Noisy Labels [J].
Li, Junyi ;
Pei, Jian ;
Huang, Heng .
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, :914-924
[43]   Local Stochastic ADMM for Communication-Efficient Distributed Learning [J].
ben Issaid, Chaouki ;
Elgabli, Anis ;
Bennis, Mehdi .
2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, :1880-1885
[44]   ALS Algorithm for Robust and Communication-Efficient Federated Learning [J].
Hurley, Neil ;
Duriakova, Erika ;
Geraci, James ;
O'Reilly-Morgan, Diarmuid ;
Tragos, Elias ;
Smyth, Barry ;
Lawlor, Aonghus .
PROCEEDINGS OF THE 2024 4TH WORKSHOP ON MACHINE LEARNING AND SYSTEMS, EUROMLSYS 2024, 2024, :56-64
[45]   Communication-Efficient and Resilient Distributed Q-Learning [J].
Xie, Yijing ;
Mou, Shaoshuai ;
Sundaram, Shreyas .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) :3351-3364
[46]   Communication-Efficient Distributed Learning of Discrete Probability Distributions [J].
Diakonikolas, Ilias ;
Grigorescu, Elena ;
Li, Jerry ;
Natarajan, Abhiram ;
Onak, Krzysztof ;
Schmidt, Ludwig .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[47]   Communication-Efficient and Privacy-Aware Distributed Learning [J].
Gogineni, Vinay Chakravarthi ;
Moradi, Ashkan ;
Venkategowda, Naveen K. D. ;
Werner, Stefan .
IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 :705-720
[48]   Ordered Gradient Approach for Communication-Efficient Distributed Learning [J].
Chen, Yicheng ;
Sadler, Brian M. ;
Blum, Rick S. .
PROCEEDINGS OF THE 21ST IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC2020), 2020,
[49]   SafeML: A Privacy-Preserving Byzantine-Robust Framework for Distributed Machine Learning Training [J].
Mirabi, Meghdad ;
Nikiel, Rene Klaus ;
Binnig, Carsten .
2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, :207-216
[50]   RSA: Byzantine-Robust Stochastic Aggregation Methods for Distributed Learning from Heterogeneous Datasets [J].
Li, Liping ;
Xu, Wei ;
Chen, Tianyi ;
Giannakis, Georgios B. ;
Ling, Qing .
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, :1544-1551