A Monte Carlo Tree Search approach to finding efficient patrolling schemes on graphs

被引：16

作者：

Karwowski, Jan ^{[1
]}

Mandziuk, Jacek ^{[1
]}

机构：

[1] Warsaw Univ Technol, Fac Math & Informat Sci, Koszykowa 75, PL-00662 Warsaw, Poland

来源：

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH | 2019年 / 277卷 / 01期

关键词：

Game theory; Stackelberg games; MOTS; Security games; SECURITY GAMES; TIME; UCT; INTERDICTION; INFORMATION; EQUILIBRIA; ALGORITHM;

D O I：

10.1016/j.ejor.2019.02.017

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

In this paper, we propose an evader-defender type of game for modeling multi-step patrolling scenarios on a graph. The game utilizes a specifically designed graph-based setting which captures spatial arrangements of the protected area, for instance industrial premises or warehouses, wherein certain valuable assets are stored. The game is played by two sides: the evader who attempts to steal or destroy the assets and the defender whose aim is to intercept the evader and prevent him/her from accomplishing his/her goal. Real-life specificity of the proposed game assumes information asymmetry between the two sides as the evader can usually observe defender's patrolling schedules prior to making decision of an attack. For this reason, we employ the Stackelberg Game principles to model our game and consequently focus on approximation of Stackelberg Equilibrium during the solution process. To this end we propose a novel approach, called Mixed-UCT, which relies on Upper Confidence Bound applied to Trees algorithm - a variant of Monte Carlo Tree Search. The efficacy of the proposed solution method is experimentally evaluated on randomly generated games played in warehouse-like, industrial environment. The results show that Mixed-UCT is efficient and scales very well for multi-step games with reasonable number of steps, leading to optimal or close-to-optimal strategies. (C) 2019 Elsevier B.V. All rights reserved.

引用

页码：255 / 268

页数：14

共 73 条

[1] A Deployed Quantal Response-Based Patrol Planning System for the U.S. Coast Guard [J].