Communication Efficient and Byzantine Tolerant Distributed Learning

被引:0
作者
Ghosh, Avishek [1 ]
Maity, Raj Kumar [2 ]
Kadhe, Swanand [1 ]
Mazumdar, Arya [2 ]
Ramachandran, Kannan [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] UMASS Amherst, Coll Informat & Comp Sci, Amherst, MA USA
来源
2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT) | 2020年
关键词
D O I
10.1109/isit44484.2020.9174391
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We develop a communication-efficient distributed learning algorithm that is robust against Byzantine worker machines. We propose and analyze a distributed gradient-descent algorithm that performs a simple thresholding based on gradient norms to mitigate Byzantine failures. We show the (statistical) error-rate of our algorithm matches that of Yin et al., 2018, which uses more complicated schemes (like coordinate-wise median or trimmed mean). Furthermore, for communication efficiency, we consider a generic class of delta-approximate compressors from Karimireddy et al., 2019, that encompasses sign-based compressors and top-k sparsification. Our algorithm uses compressed gradients and gradient norms for aggregation and Byzantine removal respectively. We establish the statistical error rate of the algorithm for arbitrary (convex or non-convex) smooth loss function. We show that, in certain regime of delta, the rate of convergence is not affected by the compression operation. We have experimentally validated our results and shown good performance in convergence for convex (least-square regression) and non-convex (neural network training) problems.
引用
收藏
页码:2545 / 2550
页数:6
相关论文
共 32 条
[1]  
Alistarh D, 2018, ADV NEUR IN, V31
[2]  
Alistarh Dan., 2017, Communication-efficient stochastic gradient descent
[3]  
[Anonymous], 2019, P MACHINE LEARNING R
[4]  
[Anonymous], 2017, ADV NEURAL INF PROCE
[5]  
[Anonymous], 2017, BYZANTINE TOLERANT M
[6]  
Bernstein J., 2018, ARXIV180204434
[7]  
Bernstein J., 2018, ICLR
[8]  
Bernstein J, 2018, PR MACH LEARN RES, V80
[9]  
Chen Kaifei, 2018, ACM T INTEL SYST TEC, V1, P1, DOI DOI 10.1145/3298981