Optimizing Reinforcement Learning Control Model in Furuta Pendulum and Transferring it to Real-World

被引:1
|
作者
Hong, Myung Rae [1 ]
Kang, Sanghun [1 ]
Lee, Jingoo [2 ]
Seo, Sungchul [3 ]
Han, Seungyong [1 ]
Koh, Je-Sung [1 ]
Kang, Daeshik [1 ]
机构
[1] Ajou Univ, Dept Mech Engn, Multiscale Bioinspired Technol Lab, Suwon 16499, South Korea
[2] Korea Inst Machinery ad Mat, Dept Sustainable Environm Res, Multiscale Bioinspired Technol Lab, Daejeon 34103, South Korea
[3] Seokyeong Univ, Dept Nanochem Biol & Environm Engn, Seoul 02713, South Korea
基金
新加坡国家研究基金会;
关键词
Furuta pendulum; inverted pendulum problem; reward design; reinforcement learning; Sim2Real;
D O I
10.1109/ACCESS.2023.3310405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning does not require explicit robot modeling as it learns on its own based on data, but it has temporal and spatial constraints when transferred to real-world environments. In this research, we trained a balancing Furuta pendulum problem, which is difficult to model, in a virtual environment (Unity) and transferred it to the real world. The challenge of the balancing Furuta pendulum problem is to maintain the pendulum's end effector in a vertical position. We resolved the temporal and spatial constraints by performing reinforcement learning in a virtual environment. Furthermore, we designed a novel reward function that enabled faster and more stable problem-solving compared to the two existing reward functions. We validate each reward function by applying it to the soft actor-critic (SAC) and proximal policy optimization (PPO). The experimental result shows that cosine reward function is trained faster and more stable. Finally, SAC algorithm model using a cosine reward function in the virtual environment is an optimized controller. Additionally, we evaluated the robustness of this model by transferring it to the real environment.
引用
收藏
页码:95195 / 95200
页数:6
相关论文
共 50 条
  • [41] Design of a Neural Controller Using Reinforcement Learning to Control a Rotational Inverted Pendulum
    Brown, Dominic
    Strube, Martin
    2020 21ST INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM), 2020,
  • [42] Modeling, Simulation, and Control of a Rotary Inverted Pendulum: A Reinforcement Learning-Based Control Approach
    Hernandez, Ruben
    Garcia-Hernandez, Ramon
    Jurado, Francisco
    MODELLING, 2024, 5 (04): : 1824 - 1852
  • [43] Applying grid world based reinforcement learning to real world collaborative transport
    Hammerle, Alexander
    Heindl, Christoph
    Stuebl, Gernot
    Thapa, Jenish
    Lamon, Edoardo
    Pichler, Andreas
    5TH INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING, ISM 2023, 2024, 232 : 388 - 396
  • [44] A Lightweight Simulation Framework for Learning Control Policies for Autonomous Vehicles in Real-World Traffic Condition
    Al-Qizwini, Mohammed
    Bulan, Orhan
    Qi, Xuewei
    Mengistu, Yehenew
    Mahesh, Sheetal
    Hwang, Joon
    Clifford, David
    IEEE SENSORS JOURNAL, 2021, 21 (14) : 15762 - 15774
  • [45] Development Environment of Reinforcement Learning-based Controllers for Real-world Physical Systems Using LW-RCP
    Lee T.
    Ju D.
    Lee Y.S.
    Journal of Institute of Control, Robotics and Systems, 2023, 29 (07) : 543 - 549
  • [46] A Data-Driven Reinforcement Learning Enabled Battery Fast Charging Optimization Using Real-World Experimental Data
    He, Jiarui
    Yang, Tianyi
    Xie, Ling
    Yang, Yikun
    Chen, Chunlin
    Wei, Jingwen
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2025, 72 (01) : 430 - 438
  • [47] An attention model for the formation of collectives in real-world domains
    Fenoy, Adria
    Bistaffa, Filippo
    Farinelli, Alessandro
    ARTIFICIAL INTELLIGENCE, 2024, 328
  • [48] A World Model for Actor-Critic in Reinforcement Learning
    Panov, A. I.
    Ugadiarov, L. A.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
  • [49] Inverted pendulum control of double q-learning reinforcement learning algorithm based on neural network
    Zhang, Daode
    Wang, Xiaolong
    Li, Xuesheng
    Wang, Dong
    UPB Scientific Bulletin, Series D: Mechanical Engineering, 2020, 82 (02): : 15 - 26
  • [50] DR-MPC: Deep Residual Model Predictive Control for Real-World Social Navigation
    Han, James R.
    Thomas, Hugues
    Zhang, Jian
    Rhinehart, Nicholas
    Barfoot, Timothy D.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (04): : 4029 - 4036