Reliability evaluation of reinforcement learning methods for mechanical systems with increasing complexity

被引：4

作者：

Manzl, Peter ^{[1
]}

Rogov, Oleg ^{[2
]}

Gerstmayr, Johannes ^{[1
]}

Mikkola, Aki ^{[2
]}

Orzechowski, Grzegorz ^{[2
]}

机构：

[1] Univ Innsbruck, Dept Mechatron, Tech Str 13, A-6020 Innsbruck, Tyrol, Austria

[2] LUT Univ, Dept Mech Engn, Yliopistonkatu 34, Lappeenranta 53850, South Karelia, Finland

来源：

MULTIBODY SYSTEM DYNAMICS | 2023年

关键词：

Reinforcement learning; Reliability analysis; Inverse pendulum; Machine learning; Dynamical systems; DYNAMICS;

D O I：

10.1007/s11044-023-09960-2

中图分类号：

O3 [力学];

学科分类号：

08 ; 0801 ;

摘要：

Reinforcement learning (RL) is one of the emerging fields of artificial intelligence (AI) intended for designing agents that take actions in the physical environment. RL has many vital applications, including robotics and autonomous vehicles. The key characteristic of RL is its ability to learn from experience without requiring direct programming or supervision. To learn, an agent interacts with an environment by acting and observing the resulting states and rewards. In most practical applications, an environment is implemented as a virtual system due to cost, time, and safety concerns. Simultaneously, multibody system dynamics (MSD) is a framework for efficiently and systematically developing virtual systems of arbitrary complexity. MSD is commonly used to create virtual models of robots, vehicles, machinery, and humans. The features of RL and MSD make them perfect companions in building sophisticated, automated, and autonomous mechatronic systems. The research demonstrates the use of RL in controlling multibody systems. While AI methods are used to solve some of the most challenging tasks in engineering, their proper understanding and implementation are demanding. Therefore, we introduce and detail three commonly used RL algorithms to control the inverted N-pendulum on the cart. Single-, double-, and triple-pendulum configurations are investigated, showing the capability of RL methods to handle increasingly complex dynamical systems. We show 2D state space zones where the agent succeeds or fails the stabilization. Despite passing randomized tests during training, blind spots may occur where the agent's policy fails. Results confirm that RL is a versatile, although complex, control engineering approach.

引用

页数：25

共 50 条

[1] Evaluation of Reinforcement Learning Methods for a Self-learning System
Bechtold, David
Wendt, Alexander
Jantsch, Axel
ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 36 - 47
[2] On reliability of reinforcement learning based production scheduling systems: a comparative survey
de Puiseau, Constantin Waubert
Meyes, Richard
Meisen, Tobias
JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (04) : 911 - 927
[3] On reliability of reinforcement learning based production scheduling systems: a comparative survey
Constantin Waubert de Puiseau
Richard Meyes
Tobias Meisen
Journal of Intelligent Manufacturing, 2022, 33 : 911 - 927
[4] Bayesian reinforcement learning reliability analysis
Zhou, Tong
Guo, Tong
Dang, Chao
Beer, Michael
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2024, 424
[5] COMPARISON OF REINFORCEMENT LEARNING METHODS FOR PRODUCTION CONTROL IN DISCRETE MANUFACTURING SYSTEMS
Yun, Lingxiang
Wang, Jingwen
Xiao, Minkun
Li, Lin
PROCEEDINGS OF ASME 2023 18TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, MSEC2023, VOL 2, 2023,
[6] Reinforcement Learning Interpretation Methods: A Survey
Alharin, Alnour
Doan, Thanh-Nam
Sartipi, Mina
IEEE ACCESS, 2020, 8 : 171058 - 171077
[7] Reinforcement Learning Methods in Public Health
Weltz, Justin
Volfovsky, Alex
Laber, Eric B.
CLINICAL THERAPEUTICS, 2022, 44 (01) : 139 - 154
[8] A tutorial on optimal control and reinforcement learning methods for quantum technologies
Giannelli, Luigi
Sgroi, Sofia
Brown, Jonathon
Paraoanu, Gheorghe Sorin
Paternostro, Mauro
Paladino, Elisabetta
Falci, Giuseppe
PHYSICS LETTERS A, 2022, 434
[9] Reinforcement Learning for Reliability Optimisation
Saka, Prasuna
Banerjee, Ansuman
THIRTEENTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ADVANCES (ICSEA 2018), 2018, : 25 - 32
[10] Direct associative reinforcement learning methods for dynamic systems control
Gullapalli, V
NEUROCOMPUTING, 1995, 9 (03) : 271 - 292

← 1 2 3 4 5 →