How do active road users act around autonomous vehicles? An inverse reinforcement learning approach

被引:10
作者
Alozi, Abdul Razak [1 ]
Hussein, Mohamed [1 ]
机构
[1] McMaster Univ, Dept Civil Engn, 1280 Main St W, Hamilton, ON L8S 4L8, Canada
关键词
Autonomous vehicles; Active road users; Inverse reinforcement learning; CYCLISTS; FLOW;
D O I
10.1016/j.trc.2024.104572
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The inevitable impact of autonomous vehicles (AV) on traffic safety is becoming a reality with the progressive deployment of these vehicles in different parts of the world. Still, many questions linger in the minds of road users that will share the road and interact with these AVs on a daily basis. To answer some of these questions, this study utilized recently collected real-world AV data from the United States, with the focus mainly targeting active road users. Specifically, the 1,492 h of recorded trips were processed to extract AV-pedestrian and AV-cyclist interactions of different movement types. The interactions were then investigated to gain a better understanding of the active road users ' behavior, while excluding any interactions that involved intervention from the AVs ' human test drivers. Through deep maximum entropy inverse reinforcement learning (DMEIRL), the reward functions describing the utility of active road users were retrieved and assessed for five interaction scenarios, including parallel, opposing, crossing, and turning (left and right) interactions. In addition, the policies developed as part of the solution were used to simulate the behavior of active road users and validate the resulting conflicts in terms of safety and evasive actions. Overall, the utilized approach demonstrated high accuracy in mimicking the interaction behavior of active road users when encountering an AV, with 81 - 84% accuracy in predicting evasive actions in parallel and opposing interactions and 12 - 17% mean absolute error for safety indicators in crossing and turning interactions. In addition, the resulting reward functions provided reliable insight onto the preferences and considerations of the active road users in these situations. Overall, cyclists tend to be less cautious around AVs compared to pedestrians, while the AVs tend to slow down and leave sufficient distance from all active road users in most cases. Still, the robotic behavior of the AVs, which can sometimes be inconsistent, leads to risky evasive actions by the active road users, which can affect other road users in busy intersections.
引用
收藏
页数:20
相关论文
共 79 条
[1]  
Abbeel P., 2004, Proceedings of the Twenty-First International Conference on Machine Learning, P1
[2]   Enhancing autonomous vehicle hyperawareness in busy traffic environments: A machine learning approach [J].
Alozi, Abdul Razak ;
Hussein, Mohamed .
ACCIDENT ANALYSIS AND PREVENTION, 2024, 198
[3]   Active Road User Interactions With Autonomous Vehicles: Proactive Safety Assessment [J].
Alozi, Abdul Razak ;
Hussein, Mohamed .
TRANSPORTATION RESEARCH RECORD, 2023, 2677 (07) :74-89
[4]   Evaluating the safety of autonomous vehicle-pedestrian interactions: An extreme value theory approach [J].
Alozi, Abdul Razak ;
Hussein, Mohamed .
ANALYTIC METHODS IN ACCIDENT RESEARCH, 2022, 35
[5]   Markov-game modeling of cyclist-pedestrian interactions in shared spaces: A multi-agent adversarial inverse reinforcement learning approach [J].
Alsaleh, Rushdi ;
Sayed, Tarek .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 128
[6]   Microscopic modeling of cyclists interactions with pedestrians in shared spaces: a Gaussian process inverse reinforcement learning approach [J].
Alsaleh, Rushdi ;
Sayed, Tarek .
TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2022, 18 (03) :828-854
[7]   Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning [J].
Alsaleh, Rushdi ;
Sayed, Tarek .
TRANSPORTATION RESEARCH PART F-TRAFFIC PSYCHOLOGY AND BEHAVIOUR, 2020, 70 :37-57
[8]   Discrete choice models of pedestrian walking behavior [J].
Antonini, Gianluca ;
Bierlaire, Michel ;
Weber, Mats .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2006, 40 (08) :667-687
[9]   On the use of virtual immersive reality for discrete choice experiments to modelling pedestrian behaviour [J].
Arellana, J. ;
Garzon, L. ;
Estrada, J. ;
Cantillo, V. .
JOURNAL OF CHOICE MODELLING, 2020, 37
[10]   A survey of inverse reinforcement learning: Challenges, methods and progress [J].
Arora, Saurabh ;
Doshi, Prashant .
ARTIFICIAL INTELLIGENCE, 2021, 297 (297)