Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic

被引:35
作者
Radac, Mircea-Bogdan [1 ]
Precup, Radu-Emil [1 ]
机构
[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara 300006, Romania
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 09期
关键词
adaptive actor-critic; model-free control; data-driven control; reinforcement learning; approximate dynamic programming; output reference model tracking; multi-input multi-output systems; vertical tank systems; Virtual Reference Feedback Tuning; CONJUGATE-GRADIENT ALGORITHM; TRAJECTORY TRACKING; NEURAL-NETWORKS; CONTROL DESIGN; SYSTEMS; FEEDBACK; ITERATION;
D O I
10.3390/app9091807
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a neural network (NN)-based control scheme in an Adaptive Actor-Critic (AAC) learning framework designed for output reference model tracking, as a representative deep-learning application. The control learning scheme is model-free with respect to the process model. AAC designs usually require an initial controller to start the learning process; however, systematic guidelines for choosing the initial controller are not offered in the literature, especially in a model-free manner. Virtual Reference Feedback Tuning (VRFT) is proposed for obtaining an initially stabilizing NN nonlinear state-feedback controller, designed from input-state-output data collected from the process in open-loop setting. The solution offers systematic design guidelines for initial controller design. The resulting suboptimal state-feedback controller is next improved under the AAC learning framework by online adaptation of a critic NN and a controller NN. The mixed VRFT-AAC approach is validated on a multi-input multi-output nonlinear constrained coupled vertical two-tank system. Discussions on the control system behavior are offered together with comparisons with similar approaches.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] MULTI-STEP ACTOR-CRITIC FRAMEWORK FOR REINFORCEMENT LEARNING IN CONTINUOUS CONTROL
    Huang T.
    Chen G.
    Journal of Applied and Numerical Optimization, 2023, 5 (02): : 189 - 200
  • [32] Development and Validation of Active Roll Control based on Actor-critic Neural Network Reinforcement Learning
    Bahr, Matthias
    Reicherts, Sebastian
    Sieberg, Philipp
    Morss, Luca
    Schramm, Dieter
    SIMULTECH: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS, 2019, 2019, : 36 - 46
  • [33] Virtual Reference Feedback Tuning of MIMO Data-Driven Model-Free Adaptive Control Algorithms
    Roman, Raul-Cristian
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    Petriu, Emil M.
    TECHNOLOGICAL INNOVATION FOR CYBER-PHYSICAL SYSTEMS, 2016, 470 : 253 - 260
  • [34] Data-Driven Model-Free Adaptive Control of Z-Source Inverters
    Asadi, Yasin
    Ahmadi, Amirhossein
    Mohammadi, Sasan
    Amani, Ali Moradi
    Marzband, Mousa
    Mohammadi-Ivatloo, Behnam
    SENSORS, 2021, 21 (22)
  • [35] Data-Driven Based Model-Free Adaptive Optimal Control Method for Hypersonic Morphing Vehicle
    Bao, Cunyu
    Wang, Peng
    Tang, Guojian
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (04) : 3713 - 3725
  • [36] Data-Driven Virtual Reference Feedback Tuning and Reinforcement Q-learning for Model-Free Position Control of an Aerodynamic System
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    Roman, Raul-Cristian
    2016 24TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2016, : 1126 - 1132
  • [37] Multilayer adaptive critic design with digital twin for data-driven optimal tracking control and industrial applications
    Wang, Ding
    Ma, Hongyu
    Qiao, Junfei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [38] Data-driven tracking control design with reinforcement learning involving a wastewater treatment application
    Wang, Ding
    Li, Xin
    Hu, Lingzhi
    Qiao, Junfei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [39] Actor-critic learning based PID control for robotic manipulators
    Nohooji, Hamed Rahimi
    Zaraki, Abolfazl
    Voos, Holger
    APPLIED SOFT COMPUTING, 2024, 151
  • [40] Heterogeneous trading strategies with adaptive fuzzy Actor-Critic reinforcement learning: A behavioral approach
    Bekiros, Stelios D.
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2010, 34 (06) : 1153 - 1170