Incentive Learning in Monte Carlo Tree Search

被引:3
作者
Kao, Kuo-Yuan [1 ]
Wu, I-Chen [2 ]
Yen, Shi-Jim [3 ]
Shan, Yi-Chang [2 ]
机构
[1] Natl Penghu Univ, Dept Informat Management, Magong City 880, Taiwan
[2] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu 30050, Taiwan
[3] Natl Dong Hwa Univ, Dept Comp Sci & Informat Engn, Hualien 974, Taiwan
基金
美国国家科学基金会;
关键词
Artificial intelligence; combinatorial games; computational intelligence; computer games; reinforcement learning;
D O I
10.1109/TCIAIG.2013.2248086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo tree search (MCTS) is a search paradigm that has been remarkably successful in computer games like Go. It uses Monte Carlo simulation to evaluate the values of nodes in a search tree. The node values are then used to select the actions during subsequent simulations. The performance of MCTS heavily depends on the quality of its default policy, which guides the simulations beyond the search tree. In this paper, we propose an MCTS improvement, called incentive learning, which learns the default policy online. This new default policy learning scheme is based on ideas from combinatorial game theory, and hence is particularly useful when the underlying game is a sum of games. To illustrate the efficiency of incentive learning, we describe a game named Heap-Go and present experimental results on the game.
引用
收藏
页码:346 / 352
页数:7
相关论文
共 50 条
  • [11] Reinforcement learning for active distribution network planning based on Monte Carlo tree search
    Zhang, Xi
    Hua, Weiqi
    Liu, Youbo
    Duan, Jiajun
    Tang, Zhiyuan
    Liu, Junyong
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 138
  • [12] Integrating Reinforcement Learning and Monte Carlo Tree Search for enhanced neoantigen vaccine design
    Lin, Yicheng
    Ma, Jiakang
    Yuan, Haozhe
    Chen, Ziqiang
    Xu, Xingyu
    Jiang, Mengping
    Zhu, Jialiang
    Meng, Weida
    Qiu, Wenqing
    Liu, Yun
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [13] Tensor Implementation of Monte-Carlo Tree Search for Model-Based Reinforcement Learning
    Balaz, Marek
    Tarabek, Peter
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [14] SpecMCTS: Accelerating Monte Carlo Tree Search Using Speculative Tree Traversal
    Kim, Juhwan
    Kang, Byeongmin
    Cho, Hyungmin
    IEEE ACCESS, 2021, 9 : 142195 - 142205
  • [15] Can Monte-Carlo Tree Search learn to sacrifice?
    Nathan Companez
    Aldeida Aleti
    Journal of Heuristics, 2016, 22 : 783 - 813
  • [16] Monte Carlo Tree Search as an intelligent search tool in structural design problems
    Rossi, Leonardo
    Winands, Mark H. M.
    Butenweg, Christoph
    ENGINEERING WITH COMPUTERS, 2022, 38 (04) : 3219 - 3236
  • [17] Monte Carlo Tree Search as an intelligent search tool in structural design problems
    Leonardo Rossi
    Mark H. M. Winands
    Christoph Butenweg
    Engineering with Computers, 2022, 38 : 3219 - 3236
  • [18] Can Monte-Carlo Tree Search learn to sacrifice?
    Companez, Nathan
    Aleti, Aldeida
    JOURNAL OF HEURISTICS, 2016, 22 (06) : 783 - 813
  • [19] Improved Monte Carlo Tree Search for Virtual Network Embedding
    Elkael, Maxime
    Castel-Taleb, Hind
    Jouaber, Badii
    Araldo, Andrea
    Aba, Massinissa Ait
    PROCEEDINGS OF THE IEEE 46TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2021), 2021, : 605 - 612
  • [20] Routing optimization with Monte Carlo Tree Search-based multi-agent reinforcement learning
    Wang, Qi
    Hao, Yongsheng
    APPLIED INTELLIGENCE, 2023, 53 (21) : 25881 - 25896