Simulation of human-vehicle interaction at right-turn unsignalized intersections: A game-theoretic deep maximum entropy inverse reinforcement learning method

被引:0
作者
Li, Wenli [1 ]
Li, Xianglong [1 ]
Li, Lingxi [2 ]
Tang, Yuanhang [1 ]
Hu, Yuanzhi [1 ]
机构
[1] Chongqing Univ Technol, Key Lab Adv Mfg Technol Automobile Parts, Minist Educ, 69 Hongguang Ave, Chongqing 400054, Peoples R China
[2] Purdue Univ, Elmore Family Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
基金
中国国家自然科学基金;
关键词
Human-vehicle interaction; Game theory; Inverse reinforcement learning; Pedestrian simulation; Reward function; SOCIAL FORCE MODEL; PEDESTRIAN BEHAVIOR;
D O I
10.1016/j.aap.2025.107960
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
The safety of pedestrians in urban transportation systems has emerged as a significant research topic. As a vulnerable group within this transportation framework, pedestrians encounter heightened safety risks in complex urban road environments. Protecting this group and safeguarding their rights and interests in urban transportation has garnered attention from academia and industry. The objective of this study is to develop a reliable simulation model that represents pedestrian crossing behavior at unsignalized crosswalks. A data- driven human-vehicle interaction behavior modeling framework is proposed, describing the human-vehicle interaction process at right-turning unsignalized intersections as a standard Markov decision-making process. In this framework, pedestrians are treated as the primary agents, and human-vehicle interactions are described using game theory. The Deep Maximum Entropy Inverse Reinforcement Learning (DMIRL) approach, combined with game theory, is employed to identify a reward function that encapsulates these interactions. The Deep Q-network (DQN) algorithm is then designed to simulate pedestrian crossing behavior based on the derived reward function. Finally, a comparison with a baseline algorithm that does not account for the game dynamics validates the proposed framework's effectiveness and feasibility.
引用
收藏
页数:13
相关论文
共 49 条
  • [1] Social LSTM: Human Trajectory Prediction in Crowded Spaces
    Alahi, Alexandre
    Goel, Kratarth
    Ramanathan, Vignesh
    Robicquet, Alexandre
    Li Fei-Fei
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 961 - 971
  • [2] Driver Modeling Through Deep Reinforcement Learning and Behavioral Game Theory
    Albaba, Berat Mert
    Yildiz, Yildiray
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2022, 30 (02) : 885 - 892
  • [3] Do road users play Nash Equilibrium? A comparison between Nash and Logistic stochastic Equilibriums for multiagent modeling of road user interactions in shared spaces
    Alsaleh, Rushdi
    Sayed, Tarek
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [4] Markov-game modeling of cyclist-pedestrian interactions in shared spaces: A multi-agent adversarial inverse reinforcement learning approach
    Alsaleh, Rushdi
    Sayed, Tarek
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 128
  • [5] Microscopic modeling of cyclists interactions with pedestrians in shared spaces: a Gaussian process inverse reinforcement learning approach
    Alsaleh, Rushdi
    Sayed, Tarek
    [J]. TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2022, 18 (03) : 828 - 854
  • [6] Basar T., 1999, Dynamic noncooperative game theory, V2nd, DOI DOI 10.1137/1.9781611971132
  • [7] Fujiwara-Greve T, 2015, Non -Cooperative Game Theory
  • [8] Foundation Intelligence for Smart Infrastructure Services in Transportation 5.0
    Han, Xu
    Meng, Zonglin
    Xia, Xin
    Liao, Xishun
    He, Brian Yueshuai
    Zheng, Zhaoliang
    Wang, Yutong
    Xiang, Hao
    Zhou, Zewei
    Gao, Letian
    Fan, Lili
    Li, Yuke
    Ma, Jiaqi
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 39 - 47
  • [9] SOCIAL FORCE MODEL FOR PEDESTRIAN DYNAMICS
    HELBING, D
    MOLNAR, P
    [J]. PHYSICAL REVIEW E, 1995, 51 (05) : 4282 - 4286
  • [10] The nash equilibrium: A perspective
    Holt, CA
    Roth, AE
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) : 3999 - 4002