Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic

被引:35
作者
Radac, Mircea-Bogdan [1 ]
Precup, Radu-Emil [1 ]
机构
[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara 300006, Romania
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 09期
关键词
adaptive actor-critic; model-free control; data-driven control; reinforcement learning; approximate dynamic programming; output reference model tracking; multi-input multi-output systems; vertical tank systems; Virtual Reference Feedback Tuning; CONJUGATE-GRADIENT ALGORITHM; TRAJECTORY TRACKING; NEURAL-NETWORKS; CONTROL DESIGN; SYSTEMS; FEEDBACK; ITERATION;
D O I
10.3390/app9091807
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a neural network (NN)-based control scheme in an Adaptive Actor-Critic (AAC) learning framework designed for output reference model tracking, as a representative deep-learning application. The control learning scheme is model-free with respect to the process model. AAC designs usually require an initial controller to start the learning process; however, systematic guidelines for choosing the initial controller are not offered in the literature, especially in a model-free manner. Virtual Reference Feedback Tuning (VRFT) is proposed for obtaining an initially stabilizing NN nonlinear state-feedback controller, designed from input-state-output data collected from the process in open-loop setting. The solution offers systematic design guidelines for initial controller design. The resulting suboptimal state-feedback controller is next improved under the AAC learning framework by online adaptation of a critic NN and a controller NN. The mixed VRFT-AAC approach is validated on a multi-input multi-output nonlinear constrained coupled vertical two-tank system. Discussions on the control system behavior are offered together with comparisons with similar approaches.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Data-driven model-free slip control of anti-lock braking systems using reinforcement Q-learning
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    NEUROCOMPUTING, 2018, 275 : 317 - 329
  • [22] Evaluating Correctness of Reinforcement Learning based on Actor-Critic Algorithm
    Kim, Youngjae
    Hussain, Manzoor
    Suh, Jae-Won
    Hong, Jang-Eui
    2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 320 - 325
  • [23] USING ACTOR-CRITIC REINFORCEMENT LEARNING FOR CONTROL AND FLIGHT FORMATION OF QUADROTORS
    Torres, Edgar
    Xu, Lei
    Sardarmehni, Tohid
    PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
  • [24] A Novel Enhanced Data-Driven Model-Free Adaptive Control Scheme for Path Tracking of Autonomous Vehicles
    Liu, Shida
    Lin, Guang
    Ji, Honghai
    Jin, Shangtai
    Hou, Zhongsheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (01) : 579 - 590
  • [25] Optimal behaviour prediction using a primitive-based data-driven model-free iterative learning control approach
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    COMPUTERS IN INDUSTRY, 2015, 74 : 95 - 109
  • [26] A Data-Driven Model-Reference Adaptive Control Approach Based on Reinforcement Learning
    Abouheaf, Mohammed
    Gueaieb, Wail
    Spinello, Davide
    Al-Sharhan, Salah
    2021 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2021), 2021,
  • [27] Expert knowledge data-driven based actor-critic reinforcement learning framework to solve computationally expensive unit commitment problems with uncertain wind energy
    Liang, Huijun
    Lin, Chenhao
    Pang, Aokang
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 159
  • [28] Model-free data driven control for trajectory tracking of an amplified piezoelectric actuator
    Shafiq, Muhammad
    Saleem, Ashraf
    Mesbah, Mostefa
    SENSORS AND ACTUATORS A-PHYSICAL, 2018, 279 : 27 - 35
  • [29] Data-Driven Model-Free Adaptive Control based on a Novel Double Successive Projection Algorithm
    Liu, Shida
    Hou, Zhongsheng
    Li, Zhenxuan
    2016 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2016,
  • [30] Lazy-Learning-Based Data-Driven Model-Free Adaptive Predictive Control for a Class of Discrete-Time Nonlinear Systems
    Hou, Zhongsheng
    Liu, Shida
    Tian, Taotao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (08) : 1914 - 1928