Hierarchical Monte Carlo Tree Search for Latent Skill Planning

被引:0
|
作者
Pei, Yue [1 ]
机构
[1] Univ Pittsburgh, Pittsburgh, PA 15213 USA
来源
2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023 | 2023年
关键词
deep reinforcement learning; monte carlo tree search; REINFORCEMENT; GO;
D O I
10.1145/3590003.3590005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte Carlo Tree Search (MCTS) continues to confront the issue of exponential complexity growth in certain tasks when the planning horizon is excessively long, causing the trajectory's past to grow exponentially. Our study presents Hierarchical MCTS Latent Skill Planner, an algorithm based on skill discovery that automatically identifies skills based on intrinsic rewards and integrates them with MCTS, enabling efficient decision-making at a higher level. In the grid world maze domain, we found that latent skill search outperformed the standard MCTS approach that do not contain skills in terms of efficiency and performance.
引用
收藏
页码:6 / 12
页数:7
相关论文
共 50 条
  • [1] The hierarchical task network planning method based on Monte Carlo Tree Search
    Shao, Tianhao
    Zhang, Hongjun
    Cheng, Kai
    Zhang, Ke
    Bie, Lin
    KNOWLEDGE-BASED SYSTEMS, 2021, 225
  • [2] Planning spatial networks with Monte Carlo tree search
    Darvariu, Victor-Alexandru
    Hailes, Stephen
    Musolesi, Mirco
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2023, 479 (2269):
  • [3] Decentralized Cooperative Planning for Automated Vehicles with Hierarchical Monte Carlo Tree Search
    Kurzer, Karl
    Zhou, Chenyang
    Zoellner, J. Marius
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 529 - 536
  • [4] Online model adaptation in Monte Carlo tree search planning
    Zuccotto, Maddalena
    Fusa, Edoardo
    Castellini, Alberto
    Farinelli, Alessandro
    OPTIMIZATION AND ENGINEERING, 2024,
  • [5] Monte Carlo Tree Search With Reinforcement Learning for Motion Planning
    Weingertner, Philippe
    Ho, Minnie
    Timofeev, Andrey
    Aubert, Sebastien
    Pita-Gil, Guillermo
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [6] Efficient Object Manipulation Planning with Monte Carlo Tree Search
    Zhu, Huaijiang
    Meduri, Avadesh
    Righetti, Ludovic
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10628 - 10635
  • [7] Nonasymptotic Analysis of Monte Carlo Tree Search
    Shah, Devavrat
    Xie, Qiaomin
    Xu, Zhi
    OPERATIONS RESEARCH, 2022, 70 (06) : 3234 - 3260
  • [8] Text Matching with Monte Carlo Tree Search
    He, Yixuan
    Tao, Shuchang
    Xu, Jun
    Guo, Jiafeng
    Lan, YanYan
    Cheng, Xueqi
    INFORMATION RETRIEVAL, CCIR 2018, 2018, 11168 : 41 - 52
  • [9] Retrosynthetic planning with experience-guided Monte Carlo tree search
    Hong, Siqi
    Zhuo, Hankz Hankui
    Jin, Kebing
    Shao, Guang
    Zhou, Zhanwen
    COMMUNICATIONS CHEMISTRY, 2023, 6 (01)
  • [10] Monte Carlo Tree Search with Metaheuristics
    Mandziuk, Jacek
    Walczak, Patryk
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 134 - 144