Incentive Learning in Monte Carlo Tree Search

被引：3

作者：

Kao, Kuo-Yuan ^{[1
]}

Wu, I-Chen ^{[2
]}

Yen, Shi-Jim ^{[3
]}

Shan, Yi-Chang ^{[2
]}

机构：

[1] Natl Penghu Univ, Dept Informat Management, Magong City 880, Taiwan

[2] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu 30050, Taiwan

[3] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Hualien 974, Taiwan

来源：

IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES | 2013年 / 5卷 / 04期

基金：

美国国家科学基金会;

关键词：

Artificial intelligence; combinatorial games; computational intelligence; computer games; reinforcement learning;

D O I：

10.1109/TCIAIG.2013.2248086

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Monte Carlo tree search (MCTS) is a search paradigm that has been remarkably successful in computer games like Go. It uses Monte Carlo simulation to evaluate the values of nodes in a search tree. The node values are then used to select the actions during subsequent simulations. The performance of MCTS heavily depends on the quality of its default policy, which guides the simulations beyond the search tree. In this paper, we propose an MCTS improvement, called incentive learning, which learns the default policy online. This new default policy learning scheme is based on ideas from combinatorial game theory, and hence is particularly useful when the underlying game is a sum of games. To illustrate the efficiency of incentive learning, we describe a game named Heap-Go and present experimental results on the game.

引用

页码：346 / 352

页数：7

共 50 条

[21] Scalable and efficient bayes-adaptive reinforcement learning based on Monte-Carlo tree search [J].

Guez, Arthur ;

Silver, David ;

Dayan, Peter .

1600, AI Access Foundation (48) :841-883

[22] Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning [J].

Conor F. Hayes ;

Mathieu Reymond ;

Diederik M. Roijers ;

Enda Howley ;

Patrick Mannion .

Autonomous Agents and Multi-Agent Systems, 2023, 37

[23] Routing optimization with Monte Carlo Tree Search-based multi-agent reinforcement learning [J].

Qi Wang ;

Yongsheng Hao .

Applied Intelligence, 2023, 53 :25881-25896

[24] Developing an Adaptive AI Agent using Supervised and Reinforcement Learning with Monte Carlo Tree Search in FightingICE [J].

Tomas, John Paul Q. ;

Aguas, Nathanael Jhonn R. ;

De Villa, Angela N. ;

Lim, Jasmin Rose G. .

2021 THE 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, CIIS 2021, 2021, :31-36

[25] Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning [J].

Hayes, Conor F. ;

Reymond, Mathieu ;

Roijers, Diederik M. ;

Howley, Enda ;

Mannion, Patrick .

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2023, 37 (02)

[26] Mutation operators for Genetic Programming using Monte Carlo Tree Search [J].

Islam, Mohiul ;

Kharma, Nawwaf ;

Grogono, Peter .

APPLIED SOFT COMPUTING, 2020, 97

[27] Knowledge complement for Monte Carlo Tree Search: an application to combinatorial games [J].

Fabbri, Andre ;

Armetta, Frederic ;

Duchene, Eric ;

Hassas, Salima .

2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, :997-1003

[28] Automated Quantum Circuit Design With Nested Monte Carlo Tree Search [J].

Wang, Peiyong ;

Usman, Muhammad ;

Parampalli, Udaya ;

Hollenberg, Lloyd C. L. ;

Myers, Casey R. .

IEEE TRANSACTIONS ON QUANTUM ENGINEERING, 2023, 4

[29] Continuous Control Monte Carlo Tree Search Informed by Multiple Experts [J].

Rajamaki, Joose ;

Hamalainen, Perttu .

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (08) :2540-2553

[30] A Monte Carlo Tree Search approach to QAOA: finding a needle in the haystack [J].

Agirre, Andoni ;

van Nieuwenburg, Evert ;

Wauters, Matteo M. .

NEW JOURNAL OF PHYSICS, 2025, 27 (04)

← 1 2 3 4 5 →