Simulation of human-vehicle interaction at right-turn unsignalized intersections: A game-theoretic deep maximum entropy inverse reinforcement learning method

被引：0

作者：

Li, Wenli ^{[1
]}

Li, Xianglong ^{[1
]}

Li, Lingxi ^{[2
]}

Tang, Yuanhang ^{[1
]}

Hu, Yuanzhi ^{[1
]}

机构：

[1] Chongqing Univ Technol, Key Lab Adv Mfg Technol Automobile Parts, Minist Educ, 69 Hongguang Ave, Chongqing 400054, Peoples R China

[2] Purdue Univ, Elmore Family Sch Elect & Comp Engn, W Lafayette, IN 47907 USA

来源：

ACCIDENT ANALYSIS AND PREVENTION | 2025年 / 214卷

基金：

中国国家自然科学基金;

关键词：

Human-vehicle interaction; Game theory; Inverse reinforcement learning; Pedestrian simulation; Reward function; SOCIAL FORCE MODEL; PEDESTRIAN BEHAVIOR;

D O I：

10.1016/j.aap.2025.107960

中图分类号：

TB18 [人体工程学];

学科分类号：

1201 ;

摘要：

The safety of pedestrians in urban transportation systems has emerged as a significant research topic. As a vulnerable group within this transportation framework, pedestrians encounter heightened safety risks in complex urban road environments. Protecting this group and safeguarding their rights and interests in urban transportation has garnered attention from academia and industry. The objective of this study is to develop a reliable simulation model that represents pedestrian crossing behavior at unsignalized crosswalks. A data- driven human-vehicle interaction behavior modeling framework is proposed, describing the human-vehicle interaction process at right-turning unsignalized intersections as a standard Markov decision-making process. In this framework, pedestrians are treated as the primary agents, and human-vehicle interactions are described using game theory. The Deep Maximum Entropy Inverse Reinforcement Learning (DMIRL) approach, combined with game theory, is employed to identify a reward function that encapsulates these interactions. The Deep Q-network (DQN) algorithm is then designed to simulate pedestrian crossing behavior based on the derived reward function. Finally, a comparison with a baseline algorithm that does not account for the game dynamics validates the proposed framework's effectiveness and feasibility.

引用

页数：13

共 49 条

[1] Social LSTM: Human Trajectory Prediction in Crowded Spaces [J].