Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic

被引：35

作者：

Radac, Mircea-Bogdan ^{[1
]}

Precup, Radu-Emil ^{[1
]}

机构：

[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara 300006, Romania

来源：

APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 09期

关键词：

adaptive actor-critic; model-free control; data-driven control; reinforcement learning; approximate dynamic programming; output reference model tracking; multi-input multi-output systems; vertical tank systems; Virtual Reference Feedback Tuning; CONJUGATE-GRADIENT ALGORITHM; TRAJECTORY TRACKING; NEURAL-NETWORKS; CONTROL DESIGN; SYSTEMS; FEEDBACK; ITERATION;

D O I：

10.3390/app9091807

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This paper proposes a neural network (NN)-based control scheme in an Adaptive Actor-Critic (AAC) learning framework designed for output reference model tracking, as a representative deep-learning application. The control learning scheme is model-free with respect to the process model. AAC designs usually require an initial controller to start the learning process; however, systematic guidelines for choosing the initial controller are not offered in the literature, especially in a model-free manner. Virtual Reference Feedback Tuning (VRFT) is proposed for obtaining an initially stabilizing NN nonlinear state-feedback controller, designed from input-state-output data collected from the process in open-loop setting. The solution offers systematic design guidelines for initial controller design. The resulting suboptimal state-feedback controller is next improved under the AAC learning framework by online adaptation of a critic NN and a controller NN. The mixed VRFT-AAC approach is validated on a multi-input multi-output nonlinear constrained coupled vertical two-tank system. Discussions on the control system behavior are offered together with comparisons with similar approaches.

引用

页数：24

共 50 条

[21] Data-driven model-free slip control of anti-lock braking systems using reinforcement Q-learning
Radac, Mircea-Bogdan
Precup, Radu-Emil
NEUROCOMPUTING, 2018, 275 : 317 - 329
[22] Evaluating Correctness of Reinforcement Learning based on Actor-Critic Algorithm
Kim, Youngjae
Hussain, Manzoor
Suh, Jae-Won
Hong, Jang-Eui
2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 320 - 325
[23] USING ACTOR-CRITIC REINFORCEMENT LEARNING FOR CONTROL AND FLIGHT FORMATION OF QUADROTORS
Torres, Edgar
Xu, Lei
Sardarmehni, Tohid
PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
[24] A Novel Enhanced Data-Driven Model-Free Adaptive Control Scheme for Path Tracking of Autonomous Vehicles
Liu, Shida
Lin, Guang
Ji, Honghai
Jin, Shangtai
Hou, Zhongsheng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 579 - 590
[25] Optimal behaviour prediction using a primitive-based data-driven model-free iterative learning control approach
Radac, Mircea-Bogdan
Precup, Radu-Emil
COMPUTERS IN INDUSTRY, 2015, 74 : 95 - 109
[26] A Data-Driven Model-Reference Adaptive Control Approach Based on Reinforcement Learning
Abouheaf, Mohammed
Gueaieb, Wail
Spinello, Davide
Al-Sharhan, Salah
2021 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2021), 2021,
[27] Expert knowledge data-driven based actor-critic reinforcement learning framework to solve computationally expensive unit commitment problems with uncertain wind energy
Liang, Huijun
Lin, Chenhao
Pang, Aokang
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 159
[28] Model-free data driven control for trajectory tracking of an amplified piezoelectric actuator
Shafiq, Muhammad
Saleem, Ashraf
Mesbah, Mostefa
SENSORS AND ACTUATORS A-PHYSICAL, 2018, 279 : 27 - 35
[29] Data-Driven Model-Free Adaptive Control based on a Novel Double Successive Projection Algorithm
Liu, Shida
Hou, Zhongsheng
Li, Zhenxuan
2016 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2016,
[30] Lazy-Learning-Based Data-Driven Model-Free Adaptive Predictive Control for a Class of Discrete-Time Nonlinear Systems
Hou, Zhongsheng
Liu, Shida
Tian, Taotao
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (08) : 1914 - 1928

← 1 2 3 4 5 →