Reliability evaluation of reinforcement learning methods for mechanical systems with increasing complexity

被引:4
作者
Manzl, Peter [1 ]
Rogov, Oleg [2 ]
Gerstmayr, Johannes [1 ]
Mikkola, Aki [2 ]
Orzechowski, Grzegorz [2 ]
机构
[1] Univ Innsbruck, Dept Mechatron, Tech Str 13, A-6020 Innsbruck, Tyrol, Austria
[2] LUT Univ, Dept Mech Engn, Yliopistonkatu 34, Lappeenranta 53850, South Karelia, Finland
关键词
Reinforcement learning; Reliability analysis; Inverse pendulum; Machine learning; Dynamical systems; DYNAMICS;
D O I
10.1007/s11044-023-09960-2
中图分类号
O3 [力学];
学科分类号
08 ; 0801 ;
摘要
Reinforcement learning (RL) is one of the emerging fields of artificial intelligence (AI) intended for designing agents that take actions in the physical environment. RL has many vital applications, including robotics and autonomous vehicles. The key characteristic of RL is its ability to learn from experience without requiring direct programming or supervision. To learn, an agent interacts with an environment by acting and observing the resulting states and rewards. In most practical applications, an environment is implemented as a virtual system due to cost, time, and safety concerns. Simultaneously, multibody system dynamics (MSD) is a framework for efficiently and systematically developing virtual systems of arbitrary complexity. MSD is commonly used to create virtual models of robots, vehicles, machinery, and humans. The features of RL and MSD make them perfect companions in building sophisticated, automated, and autonomous mechatronic systems. The research demonstrates the use of RL in controlling multibody systems. While AI methods are used to solve some of the most challenging tasks in engineering, their proper understanding and implementation are demanding. Therefore, we introduce and detail three commonly used RL algorithms to control the inverted N-pendulum on the cart. Single-, double-, and triple-pendulum configurations are investigated, showing the capability of RL methods to handle increasingly complex dynamical systems. We show 2D state space zones where the agent succeeds or fails the stabilization. Despite passing randomized tests during training, blind spots may occur where the agent's policy fails. Results confirm that RL is a versatile, although complex, control engineering approach.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Symbolic Regression Methods for Reinforcement Learning
    Kubalik, Jiri
    Derner, Erik
    Zegklitz, Jan
    Babuska, Robert
    IEEE ACCESS, 2021, 9 : 139697 - 139711
  • [32] Genetic Programming Methods for Reinforcement Learning
    Babuska, Robert
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 2 - 2
  • [33] Bayesian Nonparametric Methods for Partially-Observable Reinforcement Learning
    Doshi-Velez, Finale
    Pfau, David
    Wood, Frank
    Roy, Nicholas
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (02) : 394 - 407
  • [34] Applications of Reinforcement Learning for maintenance of engineering systems: A review
    Marugan, Alberto Pliego
    ADVANCES IN ENGINEERING SOFTWARE, 2023, 183
  • [35] Why Reinforcement Learning in Energy Systems Needs Explanations
    Butt, Hallah Shahid
    Schaefer, Benjamin
    PROCEEDINGS OF THE 2024 WORKSHOP ON EXPLAINABILITY ENGINEERING, EXEN 2024, 2024, : 26 - 30
  • [36] Beyond Reinforcement Learning and Local View in Multiagent Systems
    Bazzan, Ana L. C.
    KUNSTLICHE INTELLIGENZ, 2014, 28 (03): : 179 - 189
  • [37] Nonstrict Hierarchical Reinforcement Learning for Interactive Systems and Robots
    Cuayahuitl, Heriberto
    Kruijff-Korbayova, Ivana
    Dethlefs, Nina
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2014, 4 (03)
  • [38] Reachability Verification Based Reliability Assessment for Deep Reinforcement Learning Controlled Robotics and Autonomous Systems
    Dong, Yi
    Zhao, Xingyu
    Wang, Sen
    Huang, Xiaowei
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3299 - 3306
  • [39] Test-retest reliability of reinforcement learning parameters
    Schaaf, Jessica V.
    Weidinger, Laura
    Molleman, Lucas
    van den Bos, Wouter
    BEHAVIOR RESEARCH METHODS, 2024, 56 (05) : 4582 - 4599
  • [40] Deep reinforcement learning in production systems: a systematic literature review
    Panzer, Marcel
    Bender, Benedict
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (13) : 4316 - 4341