Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games

被引：1

作者：

Horak, Karel ^{[1
]}

Bosansky, Branislav ^{[1
]}

机构：

[1] Czech Tech Univ, Dept Comp Sci, Fac Elect Engn, Prague, Czech Republic

来源：

ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2 | 2017年

关键词：

Pursuit-evasion Games; One-sided Partial Observability; Infinite Horizon; Value Iteration; Concurrent Moves;

D O I：

10.5220/0006190605030510

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pursuit-evasion scenarios appear widely in robotics, security domains, and many other real-world situations. We focus on two-player pursuit-evasion games with concurrent moves, infinite horizon, and discounted rewards. We assume that the players have partial observability, however, the evader has an advantage of knowing the current position of pursuer's units. This setting is particularly interesting for security domains where a robust strategy, maximizing the utility in the worst-case scenario, is often desirable. We provide, to the best of our knowledge, the first algorithm that provably converges to the value of a partially observable pursuit-evasion game with infinite horizon. Our algorithm extends well-known value iteration algorithm by exploiting that (1) value functions of our game depend only on the position of the pursuer and the belief he has about the position of the evader, and (2) that these functions are piecewise linear and convex in the belief space.

引用

页码：503 / 510

页数：8

共 12 条

[1] [Anonymous], 2003, IJCAI, DOI DOI 10.5555/1630659.1630806
[2] [Anonymous], 2012, ARXIV12071412
[3] Search and pursuit-evasion in mobile robotics A survey
Chung, Timothy H.
Hollinger, Geoffrey A.
Isler, Volkan
[J]. AUTONOMOUS ROBOTS, 2011, 31 (04) : 299 - 316
[4] Ciesielski K, 2007, BANACH J MATH ANAL, V1, P1
[5] Hansen EA, 2004, PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, P709
[6] Efficient computation of equilibria for extensive two-person games
Koller, D
Megiddo, N
vonStengel, B
[J]. GAMES AND ECONOMIC BEHAVIOR, 1996, 14 (02) : 247 - 259
[7] Some classes of imperfect information finite state-space stochastic games with finite-dimensional solutions
McEneaney, WM
[J]. APPLIED MATHEMATICS AND OPTIMIZATION, 2004, 50 (02) : 87 - 118
[8] STATE-OF-THE-ART - A SURVEY OF PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES - THEORY, MODELS, AND ALGORITHMS
MONAHAN, GE
[J]. MANAGEMENT SCIENCE, 1982, 28 (01) : 1 - 16
[9] STOCHASTIC GAMES
SHAPLEY, LS
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1953, 39 (10) : 1095 - 1100
[10] OPTIMAL CONTROL OF PARTIALLY OBSERVABLE MARKOV PROCESSES OVER A FINITE HORIZON
SMALLWOOD, RD
SONDIK, EJ
[J]. OPERATIONS RESEARCH, 1973, 21 (05) : 1071 - 1088

← 1 2 →