An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information

被引：55

作者：

Bosansky, Branislav ^{[1
]}

Kiekintveld, Christopher ^{[2
]}

Lisy, Viliam ^{[1
]}

Pechoucek, Michal ^{[1
]}

机构：

[1] Czech Tech Univ, Fac Elect Engn, Dept Comp Sci, Agent Technol Ctr, CR-16635 Prague, Czech Republic

[2] Univ Texas El Paso, Dept Comp Sci, El Paso, TX 79968 USA

来源：

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH | 2014年 / 51卷

关键词：

EFFICIENT COMPUTATION; NASH EQUILIBRIA;

D O I：

10.1613/jair.4477

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Developing scalable solution algorithms is one of the central problems in computational game theory. We present an iterative algorithm for computing an exact Nash equilibrium for two-player zero-sum extensive-form games with imperfect information. Our approach combines two key elements: (1) the compact sequence-form representation of extensive-form games and (2) the algorithmic framework of double-oracle methods. The main idea of our algorithm is to restrict the game by allowing the players to play only selected sequences of available actions. After solving the restricted game, new sequences are added by finding best responses to the current solution using fast algorithms. We experimentally evaluate our algorithm on a set of games inspired by patrolling scenarios, board, and card games. The results show significant runtime improvements in games admitting an equilibrium with small support, and substantial improvement in memory use even on games with large support. The improvement in memory use is particularly important because it allows our algorithm to solve much larger game instances than existing linear programming methods. Our main contributions include (1) a generic sequence-form double-oracle algorithm for solving zero-sum extensive-form games; (2) fast methods for maintaining a valid restricted game model when adding new sequences; (3) a search algorithm and pruning methods for computing best-response sequences; (4) theoretical guarantees about the convergence of the algorithm to a Nash equilibrium; (5) experimental analysis of our algorithm on several games, including an approximate version of the algorithm.

引用

页码：829 / 866

页数：38

共 43 条

[1]

[Anonymous], 2009, IJCAI WORKSHOP GEN G

[2] Branch-and-price: Column generation for solving huge integer programs [J].

Barnhart, C ;

Johnson, EL ;

Nemhauser, GL ;

Savelsbergh, MWP ;

Vance, PH .

OPERATIONS RESEARCH, 1998, 46 (03) :316-329

[3] Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information [J].

Bosansky, Branislav ;

Kiekintveld, Christopher ;

Lisy, Viliam ;

Pechoucek, Michal .

20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 :193-+

[4]

Bosansky Branislav., 2013, Proceedings of International Conference on Autonomous Agents and Multiagent Systems (AAMAS), P335

[5] Practical Performance of Refinements of Nash Equilibria in Extensive-Form Zero-Sum Games [J].

Cermak, Jiri ;

Bosansky, Branislav ;

Lisy, Viliam .

21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 :201-+

[6] DECOMPOSITION PRINCIPLE FOR LINEAR-PROGRAMS [J].

DANTZIG, GB ;

WOLFE, P .

OPERATIONS RESEARCH, 1960, 8 (01) :101-111

[7]

Ganzfried S., 2013, COMP POK IMP INF WOR

[8]

Gibson Richard., 2012, Proceedings of the Twenty-Sixth Conference on Artificial Intelligence (AAAI), P1355

[9]

Halvorson E, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P159

[10] Smoothing Techniques for Computing Nash Equilibria of Sequential Games [J].

Hoda, Samid ;

Gilpin, Andrew ;

Pena, Javier ;

Sandholm, Tuomas .

MATHEMATICS OF OPERATIONS RESEARCH, 2010, 35 (02) :494-512

← 1 2 3 4 5 →