Beyond games: a systematic review of neural Monte Carlo tree search applications

被引:0
作者
Marco Kemmerling
Daniel Lütticke
Robert H. Schmitt
机构
[1] RWTH Aachen University,Information Management in Mechanical Engineering (WZL
来源
Applied Intelligence | 2024年 / 54卷
关键词
Monte carlo tree search; MCTS; Neural monte carlo tree search; Reinforcement learning; Model-based reinforcement learning; Decision-time planning;
D O I
暂无
中图分类号
学科分类号
摘要
The advent of AlphaGo and its successors marked the beginning of a new paradigm in playing games using artificial intelligence. This was achieved by combining Monte Carlo tree search, a planning procedure, and deep learning. While the impact on the domain of games has been undeniable, it is less clear how useful similar approaches are in applications beyond games and how they need to be adapted from the original methodology. We perform a systematic literature review of peer-reviewed articles detailing the application of neural Monte Carlo tree search methods in domains other than games. Our goal is to systematically assess how such methods are structured in practice and if their success can be extended to other domains. We find applications in a variety of domains, many distinct ways of guiding the tree search using learned policy and value functions, and various training methods. Our review maps the current landscape of algorithms in the family of neural monte carlo tree search as they are applied to practical problems, which is a first step towards a more principled way of designing such algorithms for specific problems and their requirements.
引用
收藏
页码:1020 / 1046
页数:26
相关论文
共 363 条
[1]  
Al-Saffar M(2020)Reinforcement learning-based distributed BESS management for mitigating overvoltage issues in systems with high PV penetration IEEE Trans Smart Grid 11 2980-2994
[2]  
Musilek P(2019)Deep neural network and Monte Carlo tree search applied to fluid-structure topology optimization Sci Rep 9 15916-256
[3]  
Audrey G(2002)Finite-time analysis of the multiarmed bandit problem Mach Learn 47 235-43
[4]  
Francesco M(2022)Hierarchical policy with deep-reinforcement learning for nonprehensile multiobject rearrangement Biomim Intell Robot 2 100047-35971
[5]  
Auer P(2012)A survey of Monte Carlo tree search methods IEEE Trans Comput Intell AI Games 4 1-443
[6]  
Cesa-Bianchi N(2020)Dialogue management in conversational agents through psychology of persuasion and machine learning Multimed Tools Appl 79 35949-7024
[7]  
Fischer P(2020)An intelligent task offloading algorithm (iTOA) for UAV edge computing network Digit Commun Netw 6 433-372
[8]  
Bai F(2019)iRAF: A deep reinforcement learning approach for collaborative mobile edge computing IoT networks IEEE Internet Things J 6 7011-7158
[9]  
Meng F(2021)iPAS: A deep Monte Carlo Tree Search-based intelligent pilot-power allocation scheme for massive MIMO system Digit Commun Netw 7 362-278
[10]  
Liu J(2020)Driving maneuvers prediction based autonomous driving control by deep Monte Carlo tree search IEEE Trans Veh Technol 69 7146-9