A Selective Federated Reinforcement Learning Strategy for Autonomous Driving

被引:49
作者
Fu, Yuchuan [1 ,2 ]
Li, Changle [1 ,2 ]
Yu, F. Richard [3 ]
Luan, Tom H. [4 ]
Zhang, Yao [5 ]
机构
[1] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Shaanxi, Peoples R China
[2] Xidian Univ, Res Inst Smart Transportat, Xian 710071, Shaanxi, Peoples R China
[3] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada
[4] Xidian Univ, Sch Cyber Engn, Xian 710071, Shaanxi, Peoples R China
[5] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous vehicles; Adaptation models; Reinforcement learning; Computational modeling; Data models; Training; Task analysis; Networked autonomous driving; federated learning; knowledge aggregation; WIRELESS NETWORKS; BLOCKCHAIN; MODEL; DESIGN;
D O I
10.1109/TITS.2022.3219644
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Currently, the complex traffic environment challenges the fast and accurate response of a connected autonomous vehicle (CAV). More importantly, it is difficult for different CAVs to collaborate and share knowledge. To remedy that, this paper proposes a selective federated reinforcement learning (SFRL) strategy to achieve online knowledge aggregation strategy to improve the accuracy and environmental adaptability of the autonomous driving model. First, we propose a federated reinforcement learning framework that allows participants to use the knowledge of other CAVs to make corresponding actions, thereby realizing online knowledge transfer and aggregation. Second, we use reinforcement learning to train local driving models of CAVs to cope with collision avoidance tasks. Third, considering the efficiency of federated learning (FL) and the additional communication overhead it brings, we propose a CAVs selection strategy before uploading local models. When selecting CAVs, we consider the reputation of CAVs, the quality of local models, and time overhead, so as to select as many high-quality users as possible while considering resources and time constraints. With above strategic processes, our framework can aggregate and reuse the knowledge learned by CAVs traveling in different environments to assist in driving decisions. Extensive simulation results validate that our proposal can improve model accuracy and learning efficiency while reducing communication overhead.
引用
收藏
页码:1655 / 1668
页数:14
相关论文
共 36 条
[1]  
Abdel-Aziz M. K., 2020, ARXIV
[2]   Interpretable End-to-End Urban Autonomous Driving With Latent Deep Reinforcement Learning [J].
Chen, Jianyu ;
Li, Shengbo Eben ;
Tomizuka, Masayoshi .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) :5068-5078
[3]   Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial [J].
Chen, Mingzhe ;
Challita, Ursula ;
Saad, Walid ;
Yin, Changchuan ;
Debbah, Merouane .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2019, 21 (04) :3039-3071
[4]   A Vision of C-V2X: Technologies, Field Testing, and Challenges With Chinese Development [J].
Chen, Shanzhi ;
Hu, Jinling ;
Shi, Yan ;
Zhao, Li ;
Li, Wen .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (05) :3872-3881
[5]   LTE-V: A TD-LTE-Based V2X Solution for Future Vehicular Network [J].
Chen, Shanzhi ;
Hu, Jinling ;
Shi, Yan ;
Zhao, Li .
IEEE INTERNET OF THINGS JOURNAL, 2016, 3 (06) :997-1005
[6]   A Survey of Driving Safety With Sensing, Vehicular Communications, and Artificial Intelligence-Based Collision Avoidance [J].
Fu, Yuchuan ;
Li, Changle ;
Yu, Fei Richard ;
Luan, Tom H. ;
Zhang, Yao .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) :6142-6163
[7]   A Decision-Making Strategy for Vehicle Autonomous Braking in Emergency via Deep Reinforcement Learning [J].
Fu, Yuchuan ;
Li, Changle ;
Yu, Fei Richard ;
Luan, Tom H. ;
Zhang, Yao .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (06) :5876-5888
[8]   Vehicular Blockchain-Based Collective Learning for Connected and Autonomous Vehicles [J].
Fu, Yuchuan ;
Yu, Fei Richard ;
Li, Changle ;
Luan, Tom H. ;
Zhang, Yao .
IEEE WIRELESS COMMUNICATIONS, 2020, 27 (02) :197-203
[9]   Graded Warning for Rear-End Collision: An Artificial Intelligence-Aided Algorithm [J].
Fu, Yuchuan ;
Li, Changle ;
Luan, Tom H. ;
Zhang, Yao ;
Yu, Fei Richard .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (02) :565-579
[10]  
Hsieh K, 2017, PROCEEDINGS OF NSDI '17: 14TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, P629