1-Bit Compressive Sensing for Efficient Federated Learning Over the Air

被引:15
作者
Fan, Xin [1 ]
Wang, Yue [2 ]
Huo, Yan [1 ]
Tian, Zhi [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
[2] George Mason Univ, Dept Elect & Comp Engn, Fairfax, VA 22030 USA
基金
北京市自然科学基金; 美国国家科学基金会; 中国国家自然科学基金;
关键词
Convergence; Wireless communication; Power control; Optimization; Quantization (signal); Wireless sensor networks; Training; Federated learning; analog aggregation; 1-bit compressive sensing; convergence analysis; joint optimization; STOCHASTIC GRADIENT DESCENT; COMMUNICATION; AGGREGATION; SIGNAL; ADMM;
D O I
10.1109/TWC.2022.3209190
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For distributed learning among collaborative users, this paper develops and analyzes a communication-efficient scheme for federated learning (FL) over the air, which incorporates 1-bit compressive sensing (CS) into analog-aggregation transmissions. To facilitate design parameter optimization, we analyze the efficacy of the proposed scheme by deriving a closed-form expression for the expected convergence rate. Our theoretical results unveil the tradeoff between convergence performance and communication efficiency as a result of the aggregation errors caused by sparsification, dimension reduction, quantization, signal reconstruction and noise. Then, we formulate a joint optimization problem to mitigate the impact of these aggregation errors through joint optimal design of worker scheduling and power scaling policy. An enumeration-based method is proposed to solve this non-convex problem, which is optimal but becomes computationally infeasible as the number of devices increases. For scalable computing, we resort to the alternating direction method of multipliers (ADMM) technique to develop an efficient implementation that is suitable for large-scale networks. Simulation results show that our proposed 1-bit CS based FL over the air achieves comparable performance to the ideal case where conventional FL without compression and quantification is applied over error-free aggregation, at much reduced communication overhead and transmission latency.
引用
收藏
页码:2139 / 2155
页数:17
相关论文
共 59 条
  • [1] Abdi A, 2020, AAAI CONF ARTIF INTE, V34, P3105
  • [2] Aji A. F., 2017, P C EMP METH NAT LAN, P440, DOI DOI 10.18653/V1/D17-1045
  • [3] Alistarh D, 2017, ADV NEUR IN, V30
  • [4] Federated Learning Over Wireless Fading Channels
    Amiri, Mohammad Mohammadi
    Gunduz, Deniz
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (05) : 3546 - 3557
  • [5] Machine Learning at the Wireless Edge: Distributed Stochastic Gradient Descent Over-the-Air
    Amiri, Mohammad Mohammadi
    Gunduz, Deniz
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) : 2155 - 2169
  • [6] Collaborative Machine Learning at the Wireless Edge with Blind Transmitters
    Amiri, Mohammad Mohammadi
    Duman, Tolga M.
    Gunduz, Deniz
    [J]. 2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [7] Convex Optimization: Algorithms and Complexity
    不详
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2015, 8 (3-4): : 232 - +
  • [8] [Anonymous], 2018, P ADV NEUR INF PROC
  • [9] Bernstein J, 2018, PR MACH LEARN RES, V80
  • [10] Bertsekas DP, 1996, NEURO DYNAMIC PROGRA