Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic

被引：35

作者：

Radac, Mircea-Bogdan ^{[1
]}

Precup, Radu-Emil ^{[1
]}

机构：

[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara 300006, Romania

来源：

APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 09期

关键词：

adaptive actor-critic; model-free control; data-driven control; reinforcement learning; approximate dynamic programming; output reference model tracking; multi-input multi-output systems; vertical tank systems; Virtual Reference Feedback Tuning; CONJUGATE-GRADIENT ALGORITHM; TRAJECTORY TRACKING; NEURAL-NETWORKS; CONTROL DESIGN; SYSTEMS; FEEDBACK; ITERATION;

D O I：

10.3390/app9091807

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This paper proposes a neural network (NN)-based control scheme in an Adaptive Actor-Critic (AAC) learning framework designed for output reference model tracking, as a representative deep-learning application. The control learning scheme is model-free with respect to the process model. AAC designs usually require an initial controller to start the learning process; however, systematic guidelines for choosing the initial controller are not offered in the literature, especially in a model-free manner. Virtual Reference Feedback Tuning (VRFT) is proposed for obtaining an initially stabilizing NN nonlinear state-feedback controller, designed from input-state-output data collected from the process in open-loop setting. The solution offers systematic design guidelines for initial controller design. The resulting suboptimal state-feedback controller is next improved under the AAC learning framework by online adaptation of a critic NN and a controller NN. The mixed VRFT-AAC approach is validated on a multi-input multi-output nonlinear constrained coupled vertical two-tank system. Discussions on the control system behavior are offered together with comparisons with similar approaches.

引用

页数：24

共 50 条

[31] MULTI-STEP ACTOR-CRITIC FRAMEWORK FOR REINFORCEMENT LEARNING IN CONTINUOUS CONTROL
Huang T.
Chen G.
Journal of Applied and Numerical Optimization, 2023, 5 (02): : 189 - 200
[32] Development and Validation of Active Roll Control based on Actor-critic Neural Network Reinforcement Learning
Bahr, Matthias
Reicherts, Sebastian
Sieberg, Philipp
Morss, Luca
Schramm, Dieter
SIMULTECH: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON SIMULATION AND MODELING METHODOLOGIES, TECHNOLOGIES AND APPLICATIONS, 2019, 2019, : 36 - 46
[33] Virtual Reference Feedback Tuning of MIMO Data-Driven Model-Free Adaptive Control Algorithms
Roman, Raul-Cristian
Radac, Mircea-Bogdan
Precup, Radu-Emil
Petriu, Emil M.
TECHNOLOGICAL INNOVATION FOR CYBER-PHYSICAL SYSTEMS, 2016, 470 : 253 - 260
[34] Data-Driven Model-Free Adaptive Control of Z-Source Inverters
Asadi, Yasin
Ahmadi, Amirhossein
Mohammadi, Sasan
Amani, Ali Moradi
Marzband, Mousa
Mohammadi-Ivatloo, Behnam
SENSORS, 2021, 21 (22)
[35] Data-Driven Based Model-Free Adaptive Optimal Control Method for Hypersonic Morphing Vehicle
Bao, Cunyu
Wang, Peng
Tang, Guojian
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (04) : 3713 - 3725
[36] Data-Driven Virtual Reference Feedback Tuning and Reinforcement Q-learning for Model-Free Position Control of an Aerodynamic System
Radac, Mircea-Bogdan
Precup, Radu-Emil
Roman, Raul-Cristian
2016 24TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2016, : 1126 - 1132
[37] Multilayer adaptive critic design with digital twin for data-driven optimal tracking control and industrial applications
Wang, Ding
Ma, Hongyu
Qiao, Junfei
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[38] Data-driven tracking control design with reinforcement learning involving a wastewater treatment application
Wang, Ding
Li, Xin
Hu, Lingzhi
Qiao, Junfei
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[39] Actor-critic learning based PID control for robotic manipulators
Nohooji, Hamed Rahimi
Zaraki, Abolfazl
Voos, Holger
APPLIED SOFT COMPUTING, 2024, 151
[40] Heterogeneous trading strategies with adaptive fuzzy Actor-Critic reinforcement learning: A behavioral approach
Bekiros, Stelios D.
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2010, 34 (06) : 1153 - 1170

← 1 2 3 4 5 →