Monte Carlo Tree Search with Metaheuristics

被引:0
作者
Mandziuk, Jacek [1 ]
Walczak, Patryk [1 ]
机构
[1] Warsaw Univ Technol, Warsaw, Poland
来源
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II | 2023年 / 14126卷
关键词
Monte Carlo Tree Search; Upper Confidence bounds applied to Trees; Two-player games; Metaheuristics; ALGORITHM;
D O I
10.1007/978-3-031-42508-0_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo Tree Search/Upper Confidence bounds applied to Trees (MCTS/UCT) is a popular and powerful search technique applicable to many domains, most frequently to searching game trees. Even though the algorithm has been widely researched, there is still room for its improvement, especially when combined with metaheuristics or machine learning methods. In this paper, we revise and experimentally evaluate the idea of enhancing MCTS/UCT with game-specific heuristics that guide the playout (simulation) phase. MCTS/UCT with the proposed guiding mechanism is tested on two popular board games: Othello and Hex. The enhanced method clearly defeats the well-known Alpha-beta pruning algorithm in both games, and for the more complex game (Othello) is highly competitive to the vanilla MCTS/UCT formulation.
引用
收藏
页码:134 / 144
页数:11
相关论文
共 18 条
[1]   MOHEX WINS HEX TOURNAMENT [J].
Arneson, Broderick ;
Hayward, Ryan ;
Henderson, Philip .
ICGA JOURNAL, 2009, 32 (02) :114-116
[2]  
Barratt J, 2019, Arxiv, DOI arXiv:1907.04658
[3]   CADIAPLAYER: A Simulation-Based General Game Player [J].
Bjornsson, Yngvi ;
Finnsson, Hilmar .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2009, 1 (01) :4-15
[4]   FUEGO-An Open-Source Framework for Board Games and Go Engine Based on Monte Carlo Tree Search [J].
Enzenberger, Markus ;
Mueller, Martin ;
Arneson, Broderick ;
Segal, Richard .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2010, 2 (04) :259-270
[5]  
Gelly S., 2007, P 24 INT C MACH LEAR, P273, DOI 10.1145/1273496.1273531
[6]   Monte-Carlo tree search and rapid action value estimation in computer Go [J].
Gelly, Sylvain ;
Silver, David .
ARTIFICIAL INTELLIGENCE, 2011, 175 (11) :1856-1875
[7]   A Monte Carlo Tree Search approach to finding efficient patrolling schemes on graphs [J].
Karwowski, Jan ;
Mandziuk, Jacek .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 277 (01) :255-268
[8]   A New Approach to Security Games [J].
Karwowski, Jan ;
Mandziuk, Jacek .
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II (ICAISC 2015), 2015, 9120 :402-411
[9]   Bandit based Monte-Carlo planning [J].
Kocsis, Levente ;
Szepesvari, Csaba .
MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 :282-293
[10]   Multi-Population-Based Algorithm with an Exchange of Training Plans Based on Population Evaluation [J].
Lapa, Krystian ;
Cpalka, Krzysztof ;
Kisiel-Dorohinicki, Marek ;
Paszkowski, Jozef ;
Debski, Maciej ;
Le, Van-Hung .
JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2022, 12 (04) :239-253