Federated Learning on the Road Autonomous Controller Design for Connected and Autonomous Vehicles

被引:63
作者
Zeng, Tengchan [1 ]
Semiari, Omid [2 ]
Chen, Mingzhe [3 ,4 ,5 ]
Saad, Walid [6 ]
Bennis, Mehdi [7 ]
机构
[1] Ford Motor Co, Dearborn, MI 48124 USA
[2] Univ Colorado, Dept Elect & Comp Engn, Colorado Springs, CO 80918 USA
[3] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[4] Chinese Univ Hong Kong, Shenzhen Res Inst Big Data SRIBD, Shenzhen 518172, Peoples R China
[5] Chinese Univ ofHong Kong, Future Network Intelligence Inst FNii, Shenzhen 518172, Peoples R China
[6] Virginia Tech, Dept Elect & Comp Engn, Arlington, VA 22203 USA
[7] Univ Oulu, Ctr Wireless Commun, Oulu 90570, Finland
基金
芬兰科学院; 美国国家科学基金会;
关键词
Wireless networks; machine learning; control systems; OPTIMIZATION; COMMUNICATION;
D O I
10.1109/TWC.2022.3183996
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The deployment of future intelligent transportation systems is contingent upon seamless and reliable operation of connected and autonomous vehicles (CAVs). One key challenge in developing CAVs is the design of an autonomous controller that can accurately execute near real-time control decisions, such as a quick acceleration when merging to a highway and frequent speed changes in a stop-and-go traffic. However, the use of conventional feedback controllers or traditional learning based controllers, solely trained by each CAV's local data, cannot guarantee a robust controller performance over a wide range of road conditions and traffic dynamics. In this paper, a new federated learning (FL) framework enabled by large-scale wireless connectivity is proposed for designing the autonomous controller of CAVs. In this framework, the learning models used by the controllers are collaboratively trained among a group of CAVs. To capture the varying CAV participation in the FL training process and the diverse local data quality among CAVs, a novel dynamic federated proximal (DFP) algorithm is proposed that accounts for the mobility of CAVs, the wireless fading channels, as well as the unbalanced and non-independent and identically distributed data across CAVs. A rigorous convergence analysis is performed for the proposed algorithm to identify how fast the CAVs converge to using the optimal autonomous controller.In particular, the impacts of varying CAV participation in the FL process and diverse CAV data quality on the convergence of the proposed DFP algorithm are explicitly analyzed. Leveraging this analysis, an incentive mechanism based on contract theory is designed to improve the FL convergence speed. Simulation results using real vehicular data traces show that the proposed DFP-based controller can accurately track the target CAV speed over time and under different traffic scenarios. Moreover, the results show that the proposed DFP algorithm has a much faster convergence compared to popular FL algorithms such as federated averaging (FedAvg) and federated proximal (FedProx). The results also validate the feasibility of the contract-theoretic incentive mechanism and show that the proposed mechanism can improve the convergence speed of the DFP algorithm by 40% compared to the baselines.
引用
收藏
页码:10407 / 10423
页数:17
相关论文
共 38 条
[1]  
Acosta-Marum G., 2006, P INT S WIR PERS MUL, P1
[2]  
Bolton P., 2005, CONTRACT THEORY
[3]   Optimization Methods for Large-Scale Machine Learning [J].
Bottou, Leon ;
Curtis, Frank E. ;
Nocedal, Jorge .
SIAM REVIEW, 2018, 60 (02) :223-311
[4]  
Boyd S., 2004, CONVEX OPTIMIZATION
[5]  
Chen B., 2019, PROC ADV NEURAL INF, P1
[6]   A Joint Learning and Communications Framework for Federated Learning Over Wireless Networks [J].
Chen, Mingzhe ;
Yang, Zhaohui ;
Saad, Walid ;
Yin, Changchuan ;
Poor, H. Vincent ;
Cui, Shuguang .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :269-283
[7]   Wireless Communications for Collaborative Federated Learning [J].
Chen, Mingzhe ;
Poor, H. Vincent ;
Saad, Walid ;
Cui, Shuguang .
IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (12) :48-54
[8]   An improved harmony search algorithm for synchronization of discrete-time chaotic systems [J].
Coelho, Leandro dos Santos ;
de Andrade Bernert, Diego Luis .
CHAOS SOLITONS & FRACTALS, 2009, 41 (05) :2526-2532
[9]  
Ge S.S., 2013, Stable Adaptive Neural Network Control, V13
[10]  
Gu TY, 2015, IEEE INT C INT ROBOT, P250, DOI 10.1109/IROS.2015.7353382