Distributed Deep Reinforcement Learning-Based Gradient Quantization for Federated Learning Enabled Vehicle Edge Computing

被引:27
作者
Zhang, Cui [1 ]
Zhang, Wenjun [2 ]
Wu, Qiong [2 ]
Fan, Pingyi [3 ]
Fan, Qiang [4 ]
Wang, Jiangzhou [5 ]
Letaief, Khaled B. [6 ]
机构
[1] Wuxi Inst Technol, Sch Internet Things Engn, Wuxi 214122, Peoples R China
[2] Jiangnan Univ, Sch Internet Things Engn, Wuxi 214122, Peoples R China
[3] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Dept Elect Engn, Beijing 100084, Peoples R China
[4] Qualcomm, San Jose, CA 95110 USA
[5] Univ Kent, Sch Engn, Canterbury CT2 7NT, England
[6] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Quantization (signal); Training; Data models; Computational modeling; Optimization; Internet of Things; Resource management; Distributed deep reinforcement learning (DRL); federated learning (FL); gradient quantization; vehicle edge computing (VEC); RESOURCE-ALLOCATION; COMMUNICATION; NETWORKS; CHUNK;
D O I
10.1109/JIOT.2024.3447036
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) can protect the privacy of the vehicles in vehicle edge computing (VEC) to a certain extent through sharing the gradients of vehicles' local models instead of the local data. The gradients of vehicles' local models are usually large for the vehicular artificial intelligence (AI) applications, thus transmitting such large gradients would cause large per-round latency. Gradient quantization has been proposed as one effective approach to reduce the per-round latency in FL enabled VEC through compressing gradients and reducing the number of bits, i.e., the quantization level, to transmit gradients. The selection of quantization level and thresholds determines the quantization error (QE), which further affects the model accuracy and training time. To do so, the total training time and QE become two key metrics for the FL enabled VEC. It is critical to jointly optimize the total training time and QE for the FL enabled VEC. However, the time-varying channel condition causes more challenges to solve this problem. In this article, we propose a distributed deep reinforcement learning (DRL)-based quantization level allocation scheme to optimize the long-term reward in terms of the total training time and QE. Extensive simulations identify the optimal weighted factors between the total training time and QE, and demonstrate the feasibility and effectiveness of the proposed scheme.
引用
收藏
页码:4899 / 4913
页数:15
相关论文
共 48 条
[1]  
Alistarh D, 2017, ADV NEUR IN, V30
[2]  
Chen M., 2021, "IEEE Trans.Wireless Commun., V20, P2471
[3]   Dynamic Aggregation for Heterogeneous Quantization in Federated Learning [J].
Chen, Shengbo ;
Shen, Cong ;
Zhang, Lanxue ;
Tang, Yuanmin .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (10) :6804-6819
[4]  
Cover M., 2006, Elements of Information Theory, P41
[5]  
Honig Robert, 2022, Proceedings of Machine Learning Research, P8852
[6]   ADAPTIVE QUANTIZATION OF MODEL UPDATES FOR COMMUNICATION-EFFICIENT FEDERATED LEARNING [J].
Jhunjhunwala, Divyansh ;
Gadhikar, Advait ;
Joshi, Gauri ;
Eldar, Yonina C. .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :3110-3114
[7]   Model Pruning Enables Efficient Federated Learning on Edge Devices [J].
Jiang, Yuang ;
Wang, Shiqiang ;
Valls, Victor ;
Ko, Bong Jun ;
Lee, Wei-Han ;
Leung, Kin K. ;
Tassiulas, Leandros .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) :10374-10386
[8]   Intelligent Dual Active Protocol Stack Handover Based on Double DQN Deep Reinforcement Learning for 5G mmWave Networks [J].
Lee, Changsung ;
Jung, Jaewook ;
Chung, Jong-Moon .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (07) :7572-7584
[9]   Mobility-Aware Multi-Hop Task Offloading for Autonomous Driving in Vehicular Edge Computing and Networks [J].
Liu, Lei ;
Zhao, Ming ;
Yu, Miao ;
Jan, Mian Ahmad ;
Lan, Dapeng ;
Taherkordi, Amirhosein .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) :2169-2182
[10]   Training time minimization for federated edge learning with optimized gradient quantization and bandwidth allocation [J].
Liu, Peixi ;
Jiang, Jiamo ;
Zhu, Guangxu ;
Cheng, Lei ;
Jiang, Wei ;
Luo, Wu ;
Du, Ying ;
Wang, Zhiqin .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (08) :1247-1263