Model-Free control performance improvement using virtual reference feedback tuning and reinforcement Q-learning

被引：49

作者：

Radac, Mircea-Bogdan ^{[1
]}

Precup, Radu-Emil ^{[1
,2
]}

Roman, Raul-Cristian ^{[1
]}

机构：

[1] Politehn Univ Timisoara, Dept Automat & Appl Informat, Timisoara, Romania

[2] Edith Cowan Univ, Sch Engn, Joondalup, WA, Australia

来源：

INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE | 2017年 / 48卷 / 05期

关键词：

Aerodynamic system; data-driven control; model-free control; position control; reinforcement Q-learning; virtual reference feedback tuning; CONTROL DESIGN; EXPERIMENTAL VALIDATION; TRAJECTORY TRACKING; SEARCH ALGORITHM; VRFT APPROACH; SYSTEMS; OPTIMIZATION; TORQUE;

D O I：

10.1080/00207721.2016.1236423

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes the combination of two model-free controller tuning techniques, namely linear virtual reference feedback tuning (VRFT) and nonlinear state-feedback Q-learning, referred to as a newmixed VRFT-Q learning approach. VRFT is first used to find stabilising feedback controller using input-output experimental data from the process in a model reference tracking setting. Reinforcement Q-learning is next applied in the same setting using input-state experimental data collected under perturbed VRFT to ensure good exploration. The Q-learning controller learned with a batch fitted Q iteration algorithm uses two neural networks, one for the Q-function estimator and one for the controller, respectively. The VRFT-Q learning approach is validated on position control of a two-degrees-of-motion open-loop stable multi input-multi output (MIMO) aerodynamic system (AS). Extensive simulations for the two independent control channels of theMIMO AS show that the Q-learning controllers clearly improve performance over the VRFT controllers.

引用

页码：1071 / 1083

页数：13

共 50 条

[41] A Model-Free Solution for Stackelberg Games Using Reinforcement Learning and Projection Approaches
Abouheaf, Mohammed
Gueaieb, Wail
Miah, Suruz
Abdelhameed, Esam H.
2024 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS, ROSE 2024, 2024,
[42] Online Model-Free Reinforcement Learning for Output Feedback Tracking Control of a Class of Discrete-Time Systems With Input Saturation
Al-Mahasneh, Ahmad Jobran
Anavatti, Sreenatha G.
Garratt, Matthew A.
IEEE ACCESS, 2022, 10 : 104966 - 104979
[43] Model-Free Reinforcement-Learning-Based Control Methodology for Power Electronic Converters
Alfred, Dajr
Czarkowski, Dariusz
Teng, Jiaxin
2021 13TH ANNUAL IEEE GREEN TECHNOLOGIES CONFERENCE GREENTECH 2021, 2021, : 81 - 88
[44] Parameter tuning technique for a model-free vibration control system based on a virtual controlled object
Yonezawa, Ansei
Yonezawa, Heisei
Kajiwara, Itsuro
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 165
[45] An Adaptive Model-Free Control Method for Metro Train Based on Deep Reinforcement Learning
Lai, Wenzhu
Chen, Dewang
Huang, Yunhu
Huang, Benzun
ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 263 - 273
[46] Data-Driven Model-Free Adaptive Attitude Control Approach for Launch Vehicle With Virtual Reference Feedback Parameters Dining Method
Duan, Li
Hou, Zhongsheng
Yu, Xian
Jin, Shangtai
Lu, Kunfeng
IEEE ACCESS, 2019, 7 : 54106 - 54116
[47] Model-Free H∞ Prescribed Performance Control of Adaptive Cruise Control Systems via Policy Learning
Zhao, Jun
Jia, Bingyi
Zhao, Ziliang
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024,
[48] Data-Driven Model-Free Model-Reference Nonlinear Virtual State-Feedback Control from Input-Output Data
Radac, Mircea-Bogdan
Precup, Radu-Emil
Hedrea, Elena-Lorena
Mituletu, Ion-Cornel
2018 26TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2018, : 332 - 338
[49] Model-free closed-loop wind farm control using reinforcement learning with recursive least squares
Liew, Jaime
Gocmen, Tuhfe
Lio, Wai Hou
Larsen, Gunner Chr.
WIND ENERGY, 2024, 27 (11) : 1173 - 1187
[50] A model-free control method for big time delay system based on improved iterative feedback tuning
Ai, Wei
Zhu, Xuefeng
Peng, Tao
2009 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-3, 2009, : 264 - 269

← 1 2 3 4 5 →