Stochastic Dynamic Games in Belief Space

被引：19

作者：

Schwarting, Wilko ^{[1
]}

Pierson, Alyssa ^{[1
]}

Karaman, Sertac ^{[2
]}

Rus, Daniela ^{[1
]}

机构：

[1] MIT, Comp Sci & Artificial Intelligence Lab CSAIL, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[2] MIT, Lab Informat Decis Syst LIDS, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

IEEE TRANSACTIONS ON ROBOTICS | 2021年 / 37卷 / 06期

关键词：

Games; Uncertainty; Robots; Vehicle dynamics; Planning; Nash equilibrium; Approximation algorithms; Game-theoretic planning; motion and path planning; multirobot systems; optimization and optimal control; OPTIMIZATION;

D O I：

10.1109/TRO.2021.3075376

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Information gathering while interacting with other agents under sensing and motion uncertainty is critical in domains such as driving, service robots, racing, or surveillance. The interests of agents may be at odds with others, resulting in a stochastic noncooperative dynamic game. Agents must predict others' future actions without communication, incorporate their actions into these predictions, account for uncertainty and noise in information gathering, and consider what information their actions reveal. Our solution uses local iterative dynamic programming in Gaussian belief space to solve a game-theoretic continuous POMDP. Solving a quadratic game in the backward pass of a game-theoretic belief-space variant of iterative linear-quadratic Gaussian control (iLQG) achieves a runtime polynomial in the number of agents and linear in the planning horizon. Our algorithm yields linear feedback policies for our robot, and predicted feedback policies for other agents. We present three applications: Active surveillance, guiding eyes for a blind agent, and autonomous racing. Agents with game-theoretic belief-space planning win 44% more races than without game theory and 34% more than without belief-space planning.

引用

页码：2157 / 2172

页数：16

共 50 条

[31] Stochastic bankruptcy games
Helga Habis
P. Jean Jacques Herings
International Journal of Game Theory, 2013, 42 : 973 - 988
[32] Purified Policy Space Response Oracles for Symmetric Zero-Sum Games
Shao, Zhengdao
Zhuang, Liansheng
Huang, Yihong
Li, Houqiang
Wang, Shafei
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[33] Random belief equilibrium in normal form games
Friedman, JW
Mezzetti, C
GAMES AND ECONOMIC BEHAVIOR, 2005, 51 (02) : 296 - 323
[34] A survey of static and dynamic potential games
Gonzalez-Sanchez, David
Hernandez-Lerma, Onesimo
SCIENCE CHINA-MATHEMATICS, 2016, 59 (11) : 2075 - 2102
[35] Dynamic Taxes for Polynomial Congestion Games
Bilo, Vittorio
Vinci, Cosimo
ACM TRANSACTIONS ON ECONOMICS AND COMPUTATION, 2019, 7 (03)
[36] Dynamic Policy Decision/Enforcement Security Zoning Through Stochastic Games and Meta Learning
Bello, Yahuza
Hussein, Ahmed Refaey
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2025, 22 (01): : 807 - 821
[37] Agreement and stochastic independence of belief functions
Lo, KC
MATHEMATICAL SOCIAL SCIENCES, 2006, 51 (01) : 1 - 22
[38] Constrained Discounted Stochastic Games
Anna Jaśkiewicz
Andrzej S. Nowak
Applied Mathematics & Optimization, 2022, 85
[39] Stochastic shortest path games
Patek, SD
Bertsekas, DP
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 37 (03) : 804 - 824
[40] Constrained Discounted Stochastic Games
Jaskiewicz, Anna
Nowak, Andrzej S.
APPLIED MATHEMATICS AND OPTIMIZATION, 2022, 85 (02)

← 1 2 3 4 5 →