GWPF: Communication-efficient federated learning with Gradient-Wise Parameter Freezing

被引:0
作者
Yang, Duo [1 ]
Gao, Yunqi [1 ]
Hu, Bing [1 ]
Jin, A-Long [2 ]
Wang, Wei [1 ]
You, Yang [3 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Univ Hong Kong, Hong Kong, Peoples R China
[3] Natl Univ Singapore, Singapore, Singapore
关键词
Federated learning; Communication mitigation; Parameter freezing; Frozen period; Thawing strategy;
D O I
10.1016/j.comnet.2024.110886
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Communication bottleneck is a critical challenge in federated learning. While parameter freezing has emerged as a popular approach, utilizing fine-grained parameters as aggregation objects, existing methods suffer from issues such as a lack of thawing strategy, lag and inflexibility in the thawing process, and underutilization of frozen parameters' updates. To address these challenges, we propose Gradient-Wise Parameter Freezing (GWPF), a mechanism that wisely controls frozen periods for different parameters through parameter freezing and thawing strategies. GWPF globally freezes parameters with insignificant gradients and excludes frozen parameters from global updates during the frozen period, reducing communication overhead and accelerating training. The thawing strategy, based on global decisions by the server and collaboration with clients, leverages real-time feedback on the locally accumulated gradients of frozen parameters in each round, achieving a balanced approach between mitigating communication and enhancing model accuracy. We provide theoretical analysis and a convergence guarantee for non-convex objectives. Extensive experiments confirm that our mechanism achieves a speedup of up to 4.52 times in time-to-accuracy performance and reduces communication overhead by up to 48.73%. It also improves final model accuracy by up to 2.01% compared to the existing fastest method APF.
引用
收藏
页数:13
相关论文
共 65 条
  • [1] REFL: Resource-Efficient Federated Learning
    Abdelmoniem, Ahmed M.
    Sahu, Atal Narayan
    Canini, Marco
    Fahmy, Suhaib A.
    [J]. PROCEEDINGS OF THE EIGHTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, EUROSYS 2023, 2023, : 215 - 232
  • [2] Albasyoni A., 2020, CoRR abs/2010.03246
  • [3] [Anonymous], CoRR abs/ 1212.5701
  • [4] Beutel D.J., 2020, CoRR abs/2007.14390
  • [5] Brown TB, 2020, ADV NEUR IN, V33
  • [6] GraphCS: Graph-based client selection for heterogeneity in federated learning
    Chang, Tao
    Li, Li
    Wu, MeiHan
    Yu, Wei
    Wang, Xiaodong
    Xu, ChengZhong
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 177 : 131 - 143
  • [7] Communication-Efficient Federated Learning with Adaptive Parameter Freezing
    Chen, Chen
    Xu, Hong
    Wang, Wei
    Li, Baochun
    Li, Bo
    Chen, Li
    Zhang, Gong
    [J]. 2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021), 2021, : 1 - 11
  • [8] Chen C, 2019, IEEE INFOCOM SER, P532, DOI [10.1109/INFOCOM.2019.8737587, 10.1109/infocom.2019.8737587]
  • [9] Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217
  • [10] Collins L, 2021, PR MACH LEARN RES, V139