Delay-Aware Online Resource Allocation for Buffer-Aided Synchronous Federated Learning Over Wireless Networks

被引:0
作者
Liu, Jing [1 ]
Zheng, Jinke [1 ]
Zhang, Jing [1 ]
Xiang, Lin [2 ]
Ng, Derrick Wing Kwan [3 ]
Ge, Xiaohu [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[2] Tech Univ Darmstadt, Commun Engn Lab, D-64289 Darmstadt, Germany
[3] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2033, Australia
基金
中国国家自然科学基金;
关键词
Data models; Wireless sensor networks; Training; Resource management; Wireless networks; Data centers; Solid modeling; Training data; Servers; Internet of Things; Federated learning; straggler effect; delay; Lyapunov optimization; COMMUNICATION; CHANNEL;
D O I
10.1109/ACCESS.2024.3489657
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Synchronous federated learning (FL) over wireless networks often suffers from the straggler effect, when the time required for training local models and uploading trained parameters varies significantly across heterogeneous wireless devices. This disparity prolongs the duration needed for model aggregation at the data center and slows down the convergence of synchronous FL, posing a significant challenge for FL over wireless networks. In this paper, we propose a novel buffer-aided FL scheme to mitigate the straggler effect. A buffer with sufficiently large storage is deployed at each wireless device to temporarily store the collected training data and adaptively outputs it during local training, according to the computational capabilities and communication data rates of the wireless devices. Consequently, all local models can be synchronously aggregated at the data center to reduce the number of rounds required for model aggregation in FL. To ensure timely information updates, a staleness function is further introduced to characterize the freshness of the data used to train local models. Additionally, the entropic value-at-risk (EVaR) of the data queues is introduced to eliminate the impact of discarded data at the buffers and improve the accuracy of trained local models. We formulate a delay-aware online stochastic optimization problem to minimize the long-term average staleness of all wireless devices for buffer-aided FL. Our problem formulation simultaneously guarantees the stability of data queues at the wireless devices and reduces the risk of data loss. By employing the Lyapunov optimization technique, we transform the problem into instantaneous deterministic optimization subproblems and further solve each subproblem online via utilizing its hidden convexity. Simulation results demonstrate that the proposed buffer-aided synchronous FL scheme can effectively improve the convergence rate of FL and, at the same time, ensure timely synchronization of heterogeneous wireless devices.
引用
收藏
页码:164862 / 164877
页数:16
相关论文
共 46 条
[1]   Entropic Value-at-Risk: A New Coherent Risk Measure [J].
Ahmadi-Javid, A. .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 155 (03) :1105-1123
[2]   Wireless Quantized Federated Learning: A Joint Computation and Communication Design [J].
Bouzinis, Pavlos S. ;
Diamantoulakis, Panagiotis D. ;
Karagiannidis, George K. .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (05) :2756-2770
[3]   A Joint Learning and Communications Framework for Federated Learning Over Wireless Networks [J].
Chen, Mingzhe ;
Yang, Zhaohui ;
Saad, Walid ;
Yin, Changchuan ;
Poor, H. Vincent ;
Cui, Shuguang .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :269-283
[4]   A MEASURE OF ASYMPTOTIC EFFICIENCY FOR TESTS OF A HYPOTHESIS BASED ON THE SUM OF OBSERVATIONS [J].
CHERNOFF, H .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (04) :493-507
[5]  
Follmer H., 2011, Stochastic finance: an introduction in discrete time, V2nd
[6]   QoS Differential Scheduling of URLLC Under FIFO Service Discipline: A Cross-Layer Approach [J].
Han, Di ;
Chen, Wei .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (09) :1370-1373
[7]  
Hu CH, 2023, IEEE J SEL AREA COMM, V41, P874, DOI [10.1109/JSAC.2023.3242719, 10.1109/ICASSP49357.2023.10096633]
[8]   Scalable Orchestration of Service Function Chains in NFV-Enabled Networks: A Federated Reinforcement Learning Approach [J].
Huang, Haojun ;
Zeng, Cheng ;
Zhao, Yangmin ;
Min, Geyong ;
Zhu, Ying Ying ;
Miao, Wang ;
Hu, Jia .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (08) :2558-2571
[9]  
Husnoo M. A., 2023, P IEEE C COMP COMM W, P1
[10]   Towards Efficient Synchronous Federated Training: A Survey on System Optimization Strategies [J].
Jiang, Zhifeng ;
Wang, Wei ;
Li, Bo ;
Yang, Qiang .
IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (02) :437-454