Reliability evaluation of reinforcement learning methods for mechanical systems with increasing complexity

被引:4
作者
Manzl, Peter [1 ]
Rogov, Oleg [2 ]
Gerstmayr, Johannes [1 ]
Mikkola, Aki [2 ]
Orzechowski, Grzegorz [2 ]
机构
[1] Univ Innsbruck, Dept Mechatron, Tech Str 13, A-6020 Innsbruck, Tyrol, Austria
[2] LUT Univ, Dept Mech Engn, Yliopistonkatu 34, Lappeenranta 53850, South Karelia, Finland
关键词
Reinforcement learning; Reliability analysis; Inverse pendulum; Machine learning; Dynamical systems; DYNAMICS;
D O I
10.1007/s11044-023-09960-2
中图分类号
O3 [力学];
学科分类号
08 ; 0801 ;
摘要
Reinforcement learning (RL) is one of the emerging fields of artificial intelligence (AI) intended for designing agents that take actions in the physical environment. RL has many vital applications, including robotics and autonomous vehicles. The key characteristic of RL is its ability to learn from experience without requiring direct programming or supervision. To learn, an agent interacts with an environment by acting and observing the resulting states and rewards. In most practical applications, an environment is implemented as a virtual system due to cost, time, and safety concerns. Simultaneously, multibody system dynamics (MSD) is a framework for efficiently and systematically developing virtual systems of arbitrary complexity. MSD is commonly used to create virtual models of robots, vehicles, machinery, and humans. The features of RL and MSD make them perfect companions in building sophisticated, automated, and autonomous mechatronic systems. The research demonstrates the use of RL in controlling multibody systems. While AI methods are used to solve some of the most challenging tasks in engineering, their proper understanding and implementation are demanding. Therefore, we introduce and detail three commonly used RL algorithms to control the inverted N-pendulum on the cart. Single-, double-, and triple-pendulum configurations are investigated, showing the capability of RL methods to handle increasingly complex dynamical systems. We show 2D state space zones where the agent succeeds or fails the stabilization. Despite passing randomized tests during training, blind spots may occur where the agent's policy fails. Results confirm that RL is a versatile, although complex, control engineering approach.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Opportunities for multiagent systems and multiagent reinforcement learning in traffic control
    Bazzan, Ana L. C.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) : 342 - 375
  • [42] Settling the Horizon-Dependence of Sample Complexity in Reinforcement Learning
    Li, Yuanzhi
    Wang, Ruosong
    Yang, Lin F.
    2021 IEEE 62ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2021), 2022, : 965 - 976
  • [43] On the effects of domain size and complexity in empirical distribution of reinforcement learning
    Iwata, K
    Ikeda, K
    Sakai, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (01): : 135 - 142
  • [44] Increasing Motor Noise Impairs Reinforcement Learning in Healthy Individuals
    Therrien, Amanda S.
    Wolpert, Daniel M.
    Bastian, Amy J.
    ENEURO, 2018, 5 (03)
  • [45] ArtiSAN: navigating the complexity of material structures with deep reinforcement learning
    Elsborg, Jonas
    Bhowmik, Arghya
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (03):
  • [46] Increasing the Flexibility of Hydropower with Reinforcement Learning on a Digital Twin Platform
    Tubeuf, Carlotta
    Birkelbach, Felix
    Maly, Anton
    Hofmann, Rene
    ENERGIES, 2023, 16 (04)
  • [47] Discrete-to-deep reinforcement learning methods
    Kurniawan, Budi
    Vamplew, Peter
    Papasimeon, Michael
    Dazeley, Richard
    Foale, Cameron
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03) : 1713 - 1733
  • [48] Principled Methods for Biasing Reinforcement Learning Agents
    Li, Zhi
    Hu, Kun
    Liu, Zengrong
    Yu, Xueli
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT II, 2011, 7003 : 703 - 709
  • [49] Enriching behavioral ecology with reinforcement learning methods
    Frankenhuis, Willem E.
    Panchanathan, Karthik
    Barto, Andrew G.
    BEHAVIOURAL PROCESSES, 2019, 161 : 94 - 100
  • [50] Characterizing reinforcement learning methods through parameterized learning problems
    Kalyanakrishnan, Shivaram
    Stone, Peter
    MACHINE LEARNING, 2011, 84 (1-2) : 205 - 247