Multiagent Monte Carlo Tree Search

被引:0
作者
Zerbel, Nicholas [1 ]
Yliniemi, Logan [2 ]
机构
[1] Oregon State Univ, Corvallis, OR 97331 USA
[2] Amazon Robot, Boston, MA USA
来源
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS | 2019年
关键词
Multiagent Learning; Difference Evaluations; Monte Carlo Tree Search;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Monte Carlo Tree Search (MCTS) is a best-first search which is efficient in large search spaces and is effective at balancing exploration versus exploitation. In this work, we introduce a novel extension for MCTS, called Multiagent Monte Carlo Tree Search (MAMCTS), which pairs MCTS with difference evaluations. We demonstrate the performance of MAMCTS in a cooperative, multiagent path-planning domain called Multiagent Gridworld. We show that MAMCTS using difference evaluations outperforms MAMCTS using local rewards by up to 31.4% and MAMCTS using the global reward by up to 88.9% for a system with 1,000 agents.
引用
收藏
页码:2309 / 2311
页数:3
相关论文
共 13 条
[1]   Efficient evaluation functions for evolving coordination [J].
Agogino, A. ;
Tumer, K. .
EVOLUTIONARY COMPUTATION, 2008, 16 (02) :257-288
[2]  
[Anonymous], 2005, AUTONOMOUS AGENTS AN
[3]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[4]  
Chaslot Guillaume, AIIDE, V8
[5]  
Colby M, 2015, IEEE INT C INT ROBOT, P5168, DOI 10.1109/IROS.2015.7354105
[6]  
Devlin S, 2014, AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, P165
[7]  
Gelly S., 2008, AAAI, V8, P1537
[8]  
Ponsen Marc, 2010, AAAI C ART INT
[9]  
Robles D, 2011, IEEE CONF COMPU INTE, P305, DOI 10.1109/CIG.2011.6032021
[10]   A SHOGI PROGRAM BASED ON MONTE-CARLO TREE SEARCH [J].
Sato, Yoshikuni ;
Takahashi, Daisuke ;
Grimbergen, Reijer .
ICGA JOURNAL, 2010, 33 (02) :80-92