Federated Learning Over Wireless Channels: Dynamic Resource Allocation and Task Scheduling

被引:12
作者
Chu, Shunfeng [1 ]
Li, Jun [1 ]
Wang, Jianxin [1 ]
Wang, Zhe [2 ]
Ding, Ming [3 ]
Zhang, Yijin [1 ]
Qian, Yuwen [1 ]
Chen, Wen [4 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] CSIRO, Data61, Sydney, NSW 2601, Australia
[4] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Federated learning; Markov decision processes; stochastic learning; resource allocation; dynamic programming; DELAY; NETWORKS; FRAMEWORK; POWER;
D O I
10.1109/TCCN.2022.3196009
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
With the development of federated learning (FL), mobile devices (MDs) are able to train their local models with private data and send them to a central server for aggregation, thereby preventing leakage of sensitive raw data. In this paper, we aim to improve the training performance of FL systems in the context of wireless channels and stochastic energy arrivals of each MD. To this purpose, we dynamically optimize MDs' transmission power and training task scheduling. We first model this dynamic programming problem as a constrained Markov decision process (CMDP). Due to high dimensions of the proposed CMDP problem, we propose online stochastic learning methods to simplify the CMDP and design online algorithms to obtain an efficient policy for all MDs. Since there are long-term constraints in our CMDP, we utilize a Lagrange multipliers approach to tackle this issue. Furthermore, we prove the convergence of the proposed online stochastic learning algorithm. Numerical results indicate that the proposed algorithms can achieve better performance than the benchmark algorithms.
引用
收藏
页码:1910 / 1924
页数:15
相关论文
共 47 条
[1]  
Alistarh D, 2017, ADV NEUR IN, V30
[2]   Optimal power and rate control for minimal average delay: The single-user case [J].
Bettesh, Ido ;
Shamai, Shlomo .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (09) :4115-4141
[3]   A Decentralized Optimization Framework for Energy Harvesting Devices [J].
Biason, Alessandro ;
Dey, Subhrakanti ;
Zorzi, Michele .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2018, 17 (11) :2483-2496
[4]   An actor-critic algorithm for constrained Markov decision processes [J].
Borkar, VS .
SYSTEMS & CONTROL LETTERS, 2005, 54 (03) :207-213
[5]   Stochastic approximation with two time scales [J].
Borkar, VS .
SYSTEMS & CONTROL LETTERS, 1997, 29 (05) :291-294
[6]   A Joint Learning and Communications Framework for Federated Learning Over Wireless Networks [J].
Chen, Mingzhe ;
Yang, Zhaohui ;
Saad, Walid ;
Yin, Changchuan ;
Poor, H. Vincent ;
Cui, Shuguang .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :269-283
[7]   A Survey on Delay-Aware Resource Control for Wireless Systems-Large Deviation Theory, Stochastic Lyapunov Drift, and Distributed Stochastic Learning [J].
Cui, Ying ;
Lau, Vincent K. N. ;
Wang, Rui ;
Huang, Huang ;
Zhang, Shunqing .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (03) :1677-1701
[8]   Distributive Stochastic Learning for Delay-Optimal OFDMA Power and Subband Allocation [J].
Cui, Ying ;
Lau, Vincent K. N. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (09) :4848-4858
[9]   Quality of Service Aware Computation Offloading in an Ad-Hoc Mobile Cloud [J].
Duc Van Le ;
Tham, Chen-Khong .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (09) :8890-8904
[10]  
Ha S, 2019, IEEE INT SYMP INFO, P2649, DOI [10.1109/isit.2019.8849507, 10.1109/ISIT.2019.8849507]