Recent advances in reinforcement learning-based autonomous driving behavior planning: A survey

被引:20
作者
Wu, Jingda [1 ]
Huang, Chao [2 ]
Huang, Hailong [1 ]
Lv, Chen [3 ]
Wang, Yuntong [4 ]
Wang, Fei-Yue [4 ]
机构
[1] Hong Kong Polytech Univ, Dept Aeronaut & Aviat Engn, Hong Kong, Peoples R China
[2] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hong Kong, Peoples R China
[3] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Nanyang 639798, Singapore
[4] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
关键词
Autonomous driving; Reinforcement learning; Behavior planning; Decision; Autonomous vehicle; DECISION-MAKING; SAFE; VEHICLES; MODEL; SCENARIOS; POLICIES; EFFICIENT; BARRIER;
D O I
10.1016/j.trc.2024.104654
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Autonomous driving (AD) holds the potential to revolutionize transportation efficiency, but its success hinges on robust behavior planning (BP) mechanisms. Reinforcement learning (RL) emerges as a pivotal tool in crafting these BP strategies. This paper offers a comprehensive review of RL-based BP strategies, spotlighting advancements from 2021 to 2023. We completely organize and distill the relevant literature, emphasizing paradigm shifts in RL-based BP. Introducing a novel categorization, we trace the trajectory of efforts aimed at surmounting practical challenges encountered by autonomous vehicles through innovative RL techniques. To guide readers, we furnish a quantitative analysis that maps the volume and diversity of recent RL configurations, elucidating prevailing trends. Additionally, we delve into the imminent challenges and potential directions for the future of RL-driven BP in AD. These directions encompass addressing safety vulnerabilities, fostering continual learning capabilities, enhancing data efficiency, championing collaborative vehicular cloud networks, integrating large language models, and enhancing ethical considerations.
引用
收藏
页数:28
相关论文
共 225 条
[1]   Self-Learned Autonomous Driving at Unsignalized Intersections: A Hierarchical Reinforced Learning Approach for Feasible Decision-Making [J].
Al-Sharman, Mohammad ;
Dempster, Rowan ;
Daoud, Mohamed A. ;
Nasr, Mahmoud ;
Rayside, Derek ;
Melek, William .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) :12345-12356
[2]   Deep Reinforcement Learning With NMPC Assistance Nash Switching for Urban Autonomous Driving [J].
Alighanbari, Sina ;
Azad, Nasser L. .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (03) :2604-2615
[3]   Learning Robust Control Policies for End-to-End Autonomous Driving From Data-Driven Simulation [J].
Amini, Alexander ;
Gilitschenski, Igor ;
Phillips, Jacob ;
Moseyko, Julia ;
Banerjee, Rohan ;
Karaman, Sertac ;
Rus, Daniela .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1143-1150
[4]   Multi-Agent Deep Reinforcement Learning to Manage Connected Autonomous Vehicles at Tomorrow's Intersections [J].
Antonio, Guillen-Perez ;
Maria-Dolores, Cano .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (07) :7033-7043
[5]   An End-to-End Curriculum Learning Approach for Autonomous Driving Scenarios [J].
Anzalone, Luca ;
Barra, Paola ;
Barra, Silvio ;
Castiglione, Aniello ;
Nappi, Michele .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) :19817-19826
[6]   Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm [J].
Ashraf, Nesma M. ;
Mostafa, Reham R. ;
Sakr, Rasha H. ;
Rashad, M. Z. .
PLOS ONE, 2021, 16 (06)
[7]  
Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
[8]  
Badia AP, 2020, Arxiv, DOI [arXiv:2002.06038, 10.48550/arXiv.2002.06038]
[9]  
Baheri A, 2020, IEEE INT VEH SYM, P1550, DOI 10.1109/IV47402.2020.9304744
[10]   Hybrid Reinforcement Learning-Based Eco-Driving Strategy for Connected and Automated Vehicles at Signalized Intersections [J].
Bai, Zhengwei ;
Hao, Peng ;
Shangguan, Wei ;
Cai, Baigen ;
Barth, Matthew J. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) :15850-15863