Hierarchical Monte Carlo Tree Search for Latent Skill Planning

被引:0
作者
Pei, Yue [1 ]
机构
[1] Univ Pittsburgh, Pittsburgh, PA 15213 USA
来源
2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023 | 2023年
关键词
deep reinforcement learning; monte carlo tree search; REINFORCEMENT; GO;
D O I
10.1145/3590003.3590005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo Tree Search (MCTS) continues to confront the issue of exponential complexity growth in certain tasks when the planning horizon is excessively long, causing the trajectory's past to grow exponentially. Our study presents Hierarchical MCTS Latent Skill Planner, an algorithm based on skill discovery that automatically identifies skills based on intrinsic rewards and integrates them with MCTS, enabling efficient decision-making at a higher level. In the grid world maze domain, we found that latent skill search outperformed the standard MCTS approach that do not contain skills in terms of efficiency and performance.
引用
收藏
页码:6 / 12
页数:7
相关论文
共 50 条
[21]   Incorporating Actor-Critic in Monte Carlo tree search for symbolic regression [J].
Lu, Qiang ;
Tao, Fan ;
Zhou, Shuo ;
Wang, Zhiguang .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14) :8495-8511
[22]   A TUTORIAL INTRODUCTION TO MONTE CARLO TREE SEARCH [J].
Fu, Michael C. .
2020 WINTER SIMULATION CONFERENCE (WSC), 2020, :1178-1193
[23]   Approximation Methods for Monte Carlo Tree Search [J].
Aksenov, Kirill ;
Panov, Aleksandr, I .
PROCEEDINGS OF THE FOURTH INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'19), 2020, 1156 :68-74
[24]   A TUTORIAL FOR MONTE CARLO TREE SEARCH IN AI [J].
Fu, Michael C. ;
Qiu, Daniel ;
Xu, Jie .
2024 WINTER SIMULATION CONFERENCE, WSC, 2024, :16-30
[25]   Interpretability of rectangle packing solutions with Monte Carlo tree search [J].
Lopez, Yeray Galan ;
Garcia, Cristian Gonzalez ;
Diaz, Vicente Garcia ;
Valdez, Edward Rolando Nunez ;
Gomez, Alberto Gomez .
JOURNAL OF HEURISTICS, 2024, 30 (3-4) :173-198
[26]   On Monte Carlo Tree Search and Reinforcement Learning [J].
Vodopivec, Tom ;
Samothrakis, Spyridon ;
Ster, Branko .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 60 :881-936
[27]   Multiple Pass Monte Carlo Tree Search [J].
McGuinness, Cameron .
2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, :1555-1561
[28]   Monte Carlo Tree Search for Love Letter [J].
Omarov, Tamirlan ;
Aslam, Hamna ;
Brown, Joseph Alexander ;
Reading, Elizabeth .
19TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION (GAME-ON(R) 2018), 2018, :10-15
[29]   Fittest survival: an enhancement mechanism for Monte Carlo tree search [J].
Zhang, Jiajia ;
Sun, Xiaozhen ;
Zhang, Dandan ;
Wang, Xuan ;
Qi, Shuhan ;
Qian, Tao .
INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2021, 18 (02) :122-130
[30]   Classification of Monte Carlo Tree Search Variants [J].
McGuinness, Cameron .
2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, :357-363