Design of a reinforcement learning PID controller

被引:20
|
作者
Guan, Zhe [1 ]
Yamamoto, Toru [2 ]
机构
[1] Hiroshima Univ, Dream Driven CoCreat Res Ctr, KOBELCO Construct Machinery, 1-4-1 Kagamiyama, Higashihiroshima 7398527, Japan
[2] Hiroshima Univ, Acad Sci & Technol, 1-4-1 Kagamiyama, Higashihiroshima 7398527, Japan
关键词
reinforcement learning; PID control; Actor-Critic learning; RBF network; nonlinear system; NETWORKS;
D O I
10.1002/tee.23430
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses a design scheme of a proportional-integral-derivative (PID) controller with a new adaptive updating rule based on reinforcement learning (RL) approach for nonlinear systems. A new design scheme that RL can be used to complement the conventional PID control technology is presented. In the proposed scheme, a single radial basis function (RBF) network is considered to calculate the control policy function of Actor and the value function of Critic simultaneously. Regarding the PID controller structure, the inputs of RBF network are system errors, the difference of output as well as the second-order difference of output, and they are composed of system states. The temporal difference (TD) error in the proposed scheme involves the reinforcement signal, the current and the previous stored value of the value function. The gradient descent method is adopted based on the TD error performance index, then, the updating rules can be yielded. Therefore, the network weights and the kernel function can be calculated in an adaptive way. Finally, the numerical simulations are conducted in nonlinear systems to illustrate the efficiency and robustness of the proposed scheme. (c) 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:1354 / 1360
页数:7
相关论文
共 50 条
  • [41] A study of controller design using an on-line evolutionary Reinforcement Learning
    Kondo, T
    Ito, K
    8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, : 665 - 670
  • [42] Design and implementation of an adaptive PID controller using single neuron learning algorithm
    Sheng, Q
    Zhuang, XY
    Wang, CH
    Gao, XZ
    Liu, ZL
    PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 2279 - 2283
  • [43] A Reinforcement Learning PID Compound Protocol Design Method for Ultrasonic Motor Control
    Wang, Qiang
    Yang, Yunlong
    Wu, Xiongjun
    Wang, Jinqi
    Liu, Guanyu
    2024 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE, ANZCC, 2024, : 125 - 130
  • [44] PID controller design for robust performance
    Ho, MT
    Lin, CY
    PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 1062 - 1067
  • [45] Fuzzy intervention in PID controller design
    Gharieb, W
    Nagib, G
    ISIE 2001: IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS PROCEEDINGS, VOLS I-III, 2001, : 1639 - 1643
  • [46] PID controller design with an H∞ criterion
    Han, Sangjin
    Keel, Lee H.
    Bhattacharyya, Shankar P.
    IFAC PAPERSONLINE, 2018, 51 (04): : 400 - 405
  • [47] On the design of process fuzzy PID controller
    Ben Brahim, Mohamed Amine
    Abdelkrim, Afef
    Benrejeb, Mohamed
    2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 483 - 487
  • [48] PID controller design based on memductor
    Sanchez-Lopez, C.
    Carbajal-Gomez, V. H.
    Carrasco-Aguilar, M. A.
    Morales-Lopez, F. E.
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2019, 101 : 9 - 14
  • [49] Robust PID controller design for hydroturbines
    Natarajan, K
    IEEE TRANSACTIONS ON ENERGY CONVERSION, 2005, 20 (03) : 661 - 667
  • [50] Robust Interval PID Controller Design
    Osusky, Jakub
    Kozakova, Alena
    Rosinova, Danica
    PROCEEDINGS OF THE 2020 30TH INTERNATIONAL CONFERENCE CYBERNETICS & INFORMATICS (K&I '20), 2020,