Model-Free control performance improvement using virtual reference feedback tuning and reinforcement Q-learning

被引:49
|
作者
Radac, Mircea-Bogdan [1 ]
Precup, Radu-Emil [1 ,2 ]
Roman, Raul-Cristian [1 ]
机构
[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara, Romania
[2] Edith Cowan Univ, Sch Engn, Joondalup, WA, Australia
关键词
Aerodynamic system; data-driven control; model-free control; position control; reinforcement Q-learning; virtual reference feedback tuning; CONTROL DESIGN; EXPERIMENTAL VALIDATION; TRAJECTORY TRACKING; SEARCH ALGORITHM; VRFT APPROACH; SYSTEMS; OPTIMIZATION; TORQUE;
D O I
10.1080/00207721.2016.1236423
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes the combination of two model-free controller tuning techniques, namely linear virtual reference feedback tuning (VRFT) and nonlinear state-feedback Q-learning, referred to as a newmixed VRFT-Q learning approach. VRFT is first used to find stabilising feedback controller using input-output experimental data from the process in a model reference tracking setting. Reinforcement Q-learning is next applied in the same setting using input-state experimental data collected under perturbed VRFT to ensure good exploration. The Q-learning controller learned with a batch fitted Q iteration algorithm uses two neural networks, one for the Q-function estimator and one for the controller, respectively. The VRFT-Q learning approach is validated on position control of a two-degrees-of-motion open-loop stable multi input-multi output (MIMO) aerodynamic system (AS). Extensive simulations for the two independent control channels of theMIMO AS show that the Q-learning controllers clearly improve performance over the VRFT controllers.
引用
收藏
页码:1071 / 1083
页数:13
相关论文
共 50 条
  • [41] A Model-Free Solution for Stackelberg Games Using Reinforcement Learning and Projection Approaches
    Abouheaf, Mohammed
    Gueaieb, Wail
    Miah, Suruz
    Abdelhameed, Esam H.
    2024 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS, ROSE 2024, 2024,
  • [42] Online Model-Free Reinforcement Learning for Output Feedback Tracking Control of a Class of Discrete-Time Systems With Input Saturation
    Al-Mahasneh, Ahmad Jobran
    Anavatti, Sreenatha G.
    Garratt, Matthew A.
    IEEE ACCESS, 2022, 10 : 104966 - 104979
  • [43] Model-Free Reinforcement-Learning-Based Control Methodology for Power Electronic Converters
    Alfred, Dajr
    Czarkowski, Dariusz
    Teng, Jiaxin
    2021 13TH ANNUAL IEEE GREEN TECHNOLOGIES CONFERENCE GREENTECH 2021, 2021, : 81 - 88
  • [44] Parameter tuning technique for a model-free vibration control system based on a virtual controlled object
    Yonezawa, Ansei
    Yonezawa, Heisei
    Kajiwara, Itsuro
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 165
  • [45] An Adaptive Model-Free Control Method for Metro Train Based on Deep Reinforcement Learning
    Lai, Wenzhu
    Chen, Dewang
    Huang, Yunhu
    Huang, Benzun
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 263 - 273
  • [46] Data-Driven Model-Free Adaptive Attitude Control Approach for Launch Vehicle With Virtual Reference Feedback Parameters Dining Method
    Duan, Li
    Hou, Zhongsheng
    Yu, Xian
    Jin, Shangtai
    Lu, Kunfeng
    IEEE ACCESS, 2019, 7 : 54106 - 54116
  • [47] Model-Free H∞ Prescribed Performance Control of Adaptive Cruise Control Systems via Policy Learning
    Zhao, Jun
    Jia, Bingyi
    Zhao, Ziliang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
  • [48] Data-Driven Model-Free Model-Reference Nonlinear Virtual State-Feedback Control from Input-Output Data
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    Hedrea, Elena-Lorena
    Mituletu, Ion-Cornel
    2018 26TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2018, : 332 - 338
  • [49] Model-free closed-loop wind farm control using reinforcement learning with recursive least squares
    Liew, Jaime
    Gocmen, Tuhfe
    Lio, Wai Hou
    Larsen, Gunner Chr.
    WIND ENERGY, 2024, 27 (11) : 1173 - 1187
  • [50] A model-free control method for big time delay system based on improved iterative feedback tuning
    Ai, Wei
    Zhu, Xuefeng
    Peng, Tao
    2009 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-3, 2009, : 264 - 269