Do road users play Nash Equilibrium? A comparison between Nash and Logistic stochastic Equilibriums for multiagent modeling of road user interactions in shared spaces

被引：13

作者：

Alsaleh, Rushdi ^{[1
]}

Sayed, Tarek ^{[1
]}

机构：

[1] Univ British Columbia, Dept Civil Engn, 6250 Appl Sci Lane, Vancouver, BC V6T 1Z4, Canada

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2022年 / 205卷

关键词：

Shared space; Microsimulation; Cyclist and pedestrian; Multiagent model; Reward function; Nash Equilibrium; FORCE MODEL; PEDESTRIANS; BEHAVIOR; WALKING;

D O I：

10.1016/j.eswa.2022.117710

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Analyzing and modeling multiagent transportation systems such as cyclist-pedestrian interactions and their evasive-action mechanisms in shared spaces are important for operation and safety evaluations. However, there are limited studies that (1) modeled the multiagent nature of road user interactions and their concurrent sequential decision processes, and (2) investigated the ability of different equilibrium behavioral theories in predicting road user operational-level decisions and evasive-action mechanisms. This study proposes two novel multiagent approaches based on different equilibrium theories for modeling road user interactions: (1) the Multiagent Generative Adversarial Imitation Learning (MAGAIL), which utilizes Nash-Equilibrium (NE) theory that assumes fully rational and optimal mad user behavior, and (2) the Multiagent Adversarial Inverse Reinforcement Learning (MAAIRL), which utilizes Logistic-Stochastic-Best-Response-Equilibrium (LSBRE) theory that handles bounded rationality (sub-optimal) behavior. Unlike many of the traditional game-theoretic modeling approaches, which consider single time-step payoff modeling, the proposed approaches depend on Markov-Game that accounts for the stochastic nature of road user interactions and their sequential decision processes. The models recover road user multiagent reward functions and estimate their strategies using Multiagent Deep Reinforcement Learning. Using trajectories from three shared spaces in the USA and Canada, the study compared the proposed approaches' results and determined a behavior-based consistent paradigm to model equilibrium in multiagent transportation systems. The results show that LSBRE-based model predicted road user trajectories and their evasive action mechanisms with higher accuracy compared to the NE-based model.

引用

页数：20

共 76 条

[1]

Abbeel P., 2004, Apprenticeship learning via inverse reinforcement learning. pages, P1, DOI DOI 10.1145/1015330.1015430

[2] An agent model of crowd behavior in emergencies [J].

Akopov, A. S. ;

Beklaryan, L. A. .

AUTOMATION AND REMOTE CONTROL, 2015, 76 (10) :1817-1827

[3] Improvement of Maneuverability Within a Multiagent Fuzzy Transportation System With the Use of Parallel Biobjective Real-Coded Genetic Algorithm [J].

Akopov, Andranik S. ;

Beklaryan, Levon A. ;

Thakur, Manoj .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :12648-12664

[4] Markov-game modeling of cyclist-pedestrian interactions in shared spaces: A multi-agent adversarial inverse reinforcement learning approach [J].

Alsaleh, Rushdi ;

Sayed, Tarek .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 128

[5] Microscopic modeling of cyclists interactions with pedestrians in shared spaces: a Gaussian process inverse reinforcement learning approach [J].

Alsaleh, Rushdi ;

Sayed, Tarek .

TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2022, 18 (03) :828-854

[6] Modeling pedestrian-cyclist interactions in shared space using inverse reinforcement learning [J].

Alsaleh, Rushdi ;

Sayed, Tarek .

TRANSPORTATION RESEARCH PART F-TRAFFIC PSYCHOLOGY AND BEHAVIOUR, 2020, 70 :37-57

[7] Microscopic behavioural analysis of cyclist and pedestrian interactions in shared spaces [J].

Alsaleh, Rushdi ;

Hussein, Mohamed ;

Sayed, Tarek .

CANADIAN JOURNAL OF CIVIL ENGINEERING, 2020, 47 (01) :50-62

[8]

Amodei D, 2016, Arxiv, DOI arXiv:1606.06565

[9] Scenario-based stochastic MPC for vehicle speed control considering the interaction with pedestrians [J].

Anh-Tuan Tran ;

Muraleedharan, Arun ;

Okuda, Hiroyuki ;

Suzuki, Tatsuya .

IFAC PAPERSONLINE, 2020, 53 (02) :15325-15331

[10] Discrete choice models of pedestrian walking behavior [J].

Antonini, Gianluca ;

Bierlaire, Michel ;

Weber, Mats .

TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2006, 40 (08) :667-687

← 1 2 3 4 5 6 7 8 →