Incentive Learning in Monte Carlo Tree Search

被引:3
作者
Kao, Kuo-Yuan [1 ]
Wu, I-Chen [2 ]
Yen, Shi-Jim [3 ]
Shan, Yi-Chang [2 ]
机构
[1] Natl Penghu Univ, Dept Informat Management, Magong City 880, Taiwan
[2] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu 30050, Taiwan
[3] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Hualien 974, Taiwan
基金
美国国家科学基金会;
关键词
Artificial intelligence; combinatorial games; computational intelligence; computer games; reinforcement learning;
D O I
10.1109/TCIAIG.2013.2248086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo tree search (MCTS) is a search paradigm that has been remarkably successful in computer games like Go. It uses Monte Carlo simulation to evaluate the values of nodes in a search tree. The node values are then used to select the actions during subsequent simulations. The performance of MCTS heavily depends on the quality of its default policy, which guides the simulations beyond the search tree. In this paper, we propose an MCTS improvement, called incentive learning, which learns the default policy online. This new default policy learning scheme is based on ideas from combinatorial game theory, and hence is particularly useful when the underlying game is a sum of games. To illustrate the efficiency of incentive learning, we describe a game named Heap-Go and present experimental results on the game.
引用
收藏
页码:346 / 352
页数:7
相关论文
共 50 条
[11]   Reinforcement learning for active distribution network planning based on Monte Carlo tree search [J].
Zhang, Xi ;
Hua, Weiqi ;
Liu, Youbo ;
Duan, Jiajun ;
Tang, Zhiyuan ;
Liu, Junyong .
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 138
[12]   Integrating Reinforcement Learning and Monte Carlo Tree Search for enhanced neoantigen vaccine design [J].
Lin, Yicheng ;
Ma, Jiakang ;
Yuan, Haozhe ;
Chen, Ziqiang ;
Xu, Xingyu ;
Jiang, Mengping ;
Zhu, Jialiang ;
Meng, Weida ;
Qiu, Wenqing ;
Liu, Yun .
BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
[13]   Tensor Implementation of Monte-Carlo Tree Search for Model-Based Reinforcement Learning [J].
Balaz, Marek ;
Tarabek, Peter .
APPLIED SCIENCES-BASEL, 2023, 13 (03)
[14]   SpecMCTS: Accelerating Monte Carlo Tree Search Using Speculative Tree Traversal [J].
Kim, Juhwan ;
Kang, Byeongmin ;
Cho, Hyungmin .
IEEE ACCESS, 2021, 9 :142195-142205
[15]   Can Monte-Carlo Tree Search learn to sacrifice? [J].
Nathan Companez ;
Aldeida Aleti .
Journal of Heuristics, 2016, 22 :783-813
[16]   Can Monte-Carlo Tree Search learn to sacrifice? [J].
Companez, Nathan ;
Aleti, Aldeida .
JOURNAL OF HEURISTICS, 2016, 22 (06) :783-813
[17]   Improved Monte Carlo Tree Search for Virtual Network Embedding [J].
Elkael, Maxime ;
Castel-Taleb, Hind ;
Jouaber, Badii ;
Araldo, Andrea ;
Aba, Massinissa Ait .
PROCEEDINGS OF THE IEEE 46TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2021), 2021, :605-612
[18]   Monte Carlo Tree Search as an intelligent search tool in structural design problems [J].
Rossi, Leonardo ;
Winands, Mark H. M. ;
Butenweg, Christoph .
ENGINEERING WITH COMPUTERS, 2022, 38 (04) :3219-3236
[19]   Monte Carlo Tree Search as an intelligent search tool in structural design problems [J].
Leonardo Rossi ;
Mark H. M. Winands ;
Christoph Butenweg .
Engineering with Computers, 2022, 38 :3219-3236
[20]   Routing optimization with Monte Carlo Tree Search-based multi-agent reinforcement learning [J].
Wang, Qi ;
Hao, Yongsheng .
APPLIED INTELLIGENCE, 2023, 53 (21) :25881-25896