Design of a reinforcement learning PID controller

被引:20
|
作者
Guan, Zhe [1 ]
Yamamoto, Toru [2 ]
机构
[1] Hiroshima Univ, Dream Driven CoCreat Res Ctr, KOBELCO Construct Machinery, 1-4-1 Kagamiyama, Higashihiroshima 7398527, Japan
[2] Hiroshima Univ, Acad Sci & Technol, 1-4-1 Kagamiyama, Higashihiroshima 7398527, Japan
关键词
reinforcement learning; PID control; Actor-Critic learning; RBF network; nonlinear system; NETWORKS;
D O I
10.1002/tee.23430
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses a design scheme of a proportional-integral-derivative (PID) controller with a new adaptive updating rule based on reinforcement learning (RL) approach for nonlinear systems. A new design scheme that RL can be used to complement the conventional PID control technology is presented. In the proposed scheme, a single radial basis function (RBF) network is considered to calculate the control policy function of Actor and the value function of Critic simultaneously. Regarding the PID controller structure, the inputs of RBF network are system errors, the difference of output as well as the second-order difference of output, and they are composed of system states. The temporal difference (TD) error in the proposed scheme involves the reinforcement signal, the current and the previous stored value of the value function. The gradient descent method is adopted based on the TD error performance index, then, the updating rules can be yielded. Therefore, the network weights and the kernel function can be calculated in an adaptive way. Finally, the numerical simulations are conducted in nonlinear systems to illustrate the efficiency and robustness of the proposed scheme. (c) 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:1354 / 1360
页数:7
相关论文
共 50 条
  • [31] Design of PID controller and fan wing controller for FanWing
    Meng, Lin
    Wang, Shuo
    Chen, Ye
    Gao, Yang
    AIP ADVANCES, 2021, 11 (09)
  • [32] LQR controller design for affine LPV systems using reinforcement learning
    Ranjbarpur, Hosein
    Atrianfar, Hajar
    Menhaj, Mohammad Bagher
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (09) : 1807 - 1819
  • [33] Genetic reinforcement learning through symbiotic evolution for fuzzy controller design
    Juang, CF
    Lin, CT
    1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, 1998, : 1281 - 1285
  • [34] Controller Design of Tracking WMR System Based on Deep Reinforcement Learning
    Lee, Chin-Tan
    Sung, Wen-Tsai
    ELECTRONICS, 2022, 11 (06)
  • [35] Design of Reinforcement Learning based PI controller for nonlinear Multivariable System
    Kumar, Kranthi P.
    Detroja, Ketan P.
    2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
  • [36] Tiltrotors Position Tracking Controller Design Using Deep Reinforcement Learning
    Huo, Yujia
    Li, Yiping
    Feng, Xisheng
    2019 5TH INTERNATIONAL CONFERENCE ON MECHANICAL AND AERONAUTICAL ENGINEERING (ICMAE 2019), 2020, 751
  • [37] Controller Design for Electrical Drives by Deep Reinforcement Learning: A Proof of Concept
    Schenke, Maximilian
    Kirchgaessner, Wilhelm
    Wallscheid, Oliver
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (07) : 4650 - 4658
  • [38] Genetic reinforcement learning through symbiotic evolution for fuzzy controller design
    Juang, CF
    Lin, JY
    Lin, CT
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (02): : 290 - 302
  • [39] A practically implementable reinforcement learning-based process controller design
    Hassanpour, Hesam
    Wang, Xiaonian
    Corbett, Brandon
    Mhaskar, Prashant
    AICHE JOURNAL, 2024, 70 (01)
  • [40] Reinforcement-Learning-Based Controller Design for Nonaffine Nonlinear Systems
    Yang, Xiong
    Liu, Derong
    Wei, Qinglai
    ADVANCES IN NEURAL NETWORKS - ISNN 2014, 2014, 8866 : 51 - 58