Monte Carlo tree search control scheme for multibody dynamics applications

被引:0
作者
Tang, Yixuan [1 ]
Orzechowski, Grzegorz [1 ]
Prokop, Ales [2 ]
Mikkola, Aki [1 ]
机构
[1] LUT Univ, Dept Mech Engn, Lappeenranta 53850, Finland
[2] Brno Univ Technol, Fac Mech Engn, Technicka 2896-2, Brno 61669, Czech Republic
关键词
Monte Carlo Tree Search; Multibody dynamics; Reward functions; Parametric analysis; Artificial intelligence control; Inverted pendulum; DOUBLE PENDULUM; SWING-UP; STRATEGY; GAME; GO;
D O I
10.1007/s11071-024-09509-8
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
There is considerable interest in applying reinforcement learning (RL) to improve machine control across multiple industries, and the automotive industry is one of the prime examples. Monte Carlo Tree Search (MCTS) has emerged and proven powerful in decision-making games, even without understanding the rules. In this study, multibody system dynamics (MSD) control is first modeled as a Markov Decision Process and solved with Monte Carlo Tree Search. Based on randomized search space exploration, the MCTS framework builds a selective search tree by repeatedly applying a Monte Carlo rollout at each child node. However, without a library of available choices, deciding among the many possibilities for agent parameters can be intimidating. In addition, the MCTS poses a significant challenge for searching due to the large branching factor. This challenge is typically overcome by appropriate parameter design, search guiding, action reduction, parallelization, and early termination. To address these shortcomings, the overarching goal of this study is to provide needed insight into inverted pendulum controls via vanilla and modified MCTS agents, respectively. A series of reward functions are well-designed according to the control goal, which maps a specific distribution shape of reward bonus and guides the MCTS-based control to maintain the upright position. Numerical examples show that the reward-modified MCTS algorithms significantly improve the control performance and robustness of the default choice of a constant reward that constitutes the vanilla MCTS. The exponentially decaying reward functions perform better than the constant value or polynomial reward functions. Moreover, the exploitation vs. exploration trade-off and discount parameters are carefully tested. The study's results can guide the research of RL-based MSD users.
引用
收藏
页码:8363 / 8391
页数:29
相关论文
共 50 条
  • [41] A Survey of Monte Carlo Tree Search Methods
    Browne, Cameron B.
    Powley, Edward
    Whitehouse, Daniel
    Lucas, Simon M.
    Cowling, Peter I.
    Rohlfshagen, Philipp
    Tavener, Stephen
    Perez, Diego
    Samothrakis, Spyridon
    Colton, Simon
    IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) : 1 - 43
  • [42] Interpretability of rectangle packing solutions with Monte Carlo tree search
    Lopez, Yeray Galan
    Garcia, Cristian Gonzalez
    Diaz, Vicente Garcia
    Valdez, Edward Rolando Nunez
    Gomez, Alberto Gomez
    JOURNAL OF HEURISTICS, 2024, 30 (3-4) : 173 - 198
  • [43] Monte-Carlo tree search as regularized policy optimization
    Grill, Jean-Bastien
    Altche, Florent
    Tang, Yunhao
    Hubert, Thomas
    Valko, Michal
    Antonoglou, Ioannis
    Munos, Remi
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [44] A Monte Carlo tree search approach to learning decision trees
    Nunes, Cecilia
    De Craene, Mathieu
    Langet, Helene
    Camara, Oscar
    Jonsson, Anders
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 429 - 435
  • [45] Multiple Pass Monte Carlo Tree Search
    McGuinness, Cameron
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 1555 - 1561
  • [46] Parallel Monte-Carlo Tree Search for HPC Systems
    Graf, Tobias
    Lorenz, Ulf
    Platzner, Marco
    Schaefers, Lars
    EURO-PAR 2011 PARALLEL PROCESSING, PT 2, 2011, 6853 : 365 - 376
  • [47] An Efficient Optimization of the Monte Carlo Tree Search Algorithm for Amazons
    Zhang, Lijun
    Zou, Han
    Zhu, Yungang
    ALGORITHMS, 2024, 17 (08)
  • [48] Hierarchical Monte Carlo Tree Search for Latent Skill Planning
    Pei, Yue
    2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 6 - 12
  • [49] Monte Carlo Tree Search for Love Letter
    Omarov, Tamirlan
    Aslam, Hamna
    Brown, Joseph Alexander
    Reading, Elizabeth
    19TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION (GAME-ON(R) 2018), 2018, : 10 - 15
  • [50] Classification of Monte Carlo Tree Search Variants
    McGuinness, Cameron
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 357 - 363