Analog Gradient Aggregation for Federated Learning Over Wireless Networks: Customized Design and Convergence Analysis

被引:88
作者
Guo, Huayan [1 ]
Liu, An [2 ]
Lau, Vincent K. N. [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Commun Engn, Hong Kong, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
关键词
Convergence; Servers; Transceivers; Internet of Things; Wireless communication; Wireless sensor networks; Linear regression; Distributed data aggregation; distributed machine learning; federated learning (FL); Internet of Things (IoT); over-the-air transmission; THE-AIR COMPUTATION; EDGE;
D O I
10.1109/JIOT.2020.3002925
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the analog gradient aggregation (AGA) solution to overcome the communication bottleneck for wireless federated learning applications by exploiting the idea of analog over-the-air transmission. Despite the various advantages, this special transmission solution also brings new challenges to both transceiver design and learning algorithm design due to the nonstationary local gradients and the time-varying wireless channels in different communication rounds. To address these issues, we propose a novel design of both the transceiver and learning algorithm for the AGA solution. In particular, the parameters in the transceiver are optimized with the consideration of the nonstationarity in the local gradients based on a simple feedback variable. Moreover, a novel learning rate design is proposed for the stochastic gradient descent algorithm, which is adaptive to the quality of the gradient estimation. Theoretical analyses are provided on the convergence rate of the proposed AGA solution. Finally, the effectiveness of the proposed solution is confirmed by two separate experiments based on linear regression and the shallow neural network. The simulation results verify that the proposed solution outperforms various state-of-the-art baseline schemes with a much faster convergence speed.
引用
收藏
页码:197 / 210
页数:14
相关论文
共 33 条
[1]   Federated Learning Over Wireless Fading Channels [J].
Amiri, Mohammad Mohammadi ;
Gunduz, Deniz .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (05) :3546-3557
[2]   Machine Learning at the Wireless Edge: Distributed Stochastic Gradient Descent Over-the-Air [J].
Amiri, Mohammad Mohammadi ;
Gunduz, Deniz .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) :2155-2169
[3]   Collaborative Machine Learning at the Wireless Edge with Blind Transmitters [J].
Amiri, Mohammad Mohammadi ;
Duman, Tolga M. ;
Gunduz, Deniz .
2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
[4]  
[Anonymous], FEDPAQ COMMUNICATION
[5]   Optimization Methods for Large-Scale Machine Learning [J].
Bottou, Leon ;
Curtis, Frank E. ;
Nocedal, Jorge .
SIAM REVIEW, 2018, 60 (02) :223-311
[6]   MSE Tail Analysis for Remote State Estimation of Linear Systems Over Multiantenna Random Access Channels [J].
Cai, Songfu ;
Lau, Vincent K. N. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (05) :2046-2061
[7]   Modulation-Free M2M Communications for Mission-Critical Applications [J].
Cai, Songfu ;
Lau, Vincent K. N. .
IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2018, 4 (02) :248-263
[8]   Over-the-Air Computation for IoT Networks: Computing Multiple Functions With Antenna Arrays [J].
Chen, Li ;
Zhao, Nan ;
Chen, Yunfei ;
Yu, F. Richard ;
Wei, Guo .
IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (06) :5296-5306
[9]   A Uniform-Forcing Transceiver Design for Over-the-Air Function Computation [J].
Chen, Li ;
Qin, Xiaowei ;
Wei, Guo .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2018, 7 (06) :942-945
[10]   Multi-objective genetic algorithm for energy-efficient hybrid flow shop scheduling with lot streaming [J].
Chen, Tzu-Li ;
Cheng, Chen-Yang ;
Chou, Yi-Han .
ANNALS OF OPERATIONS RESEARCH, 2020, 290 (1-2) :813-836