Toward Data-Driven Optimal Control: A Systematic Review of the Landscape

被引：33

作者：

Prag, Krupa ^{[1
]}

Woolway, Matthew ^{[2
]}

Celik, Turgay ^{[3
,4
]}

机构：

[1] Univ Witwatersrand, Sch Comp Sci & Appl Math, ZA-200 Johannesburg, South Africa

[2] Univ Johannesburg, Fac Engn & Built Environm, ZA-2006 Johannesburg, South Africa

[3] Univ Witwatersrand, Sch Elect & Informat Engn, ZA-2000 Johannesburg, South Africa

[4] Univ Witwatersrand, Wits Inst Data Sci, ZA-2000 Johannesburg, South Africa

来源：

IEEE ACCESS | 2022年 / 10卷

关键词：

Control systems; Mathematical models; Adaptation models; Data models; Adaptive control; Optimal control; Tuning; Data-driven control; adaptive control; model-free; model-based; model predictive control; optimal control; learning-based control; systematic review; MODEL-PREDICTIVE CONTROL; ITERATIVE LEARNING CONTROL; DISCRETE-TIME-SYSTEMS; FUZZY-LOGIC; ADAPTIVE-CONTROL; NEURAL-NETWORK; CONTROL DESIGN; ROBOT CONTROL; MPC; MANAGEMENT;

D O I：

10.1109/ACCESS.2022.3160709

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This literature review extends and contributes to research on the development of data-driven optimal control. Previous reviews have documented the development of model-based and data-driven control in isolation and have not critically reviewed reinforcement learning approaches for adaptive data-driven optimal control frameworks. The presented review discusses the development of model-based to model-free adaptive controllers, highlighting the use of data in control frameworks. In data-driven control frameworks, reinforcement learning methods may be used to derive the optimal policy for dynamical systems. Attractive characteristics of these methods include not requiring a mathematical model of complex systems, their inherent adaptive control capabilities, being an unsupervised learning technique and their decision-making abilities, which are both an advantage and motivation behind this approach. This review considers previous reviews on these topics, including recent work on data-driven control methods. In addition, this review shows the use of data to derive system dynamics, determine the control policy using feedback information, and tune fixed controllers. Furthermore, the review summarises various data-driven methods and their corresponding characteristics. Finally, the review provides a taxonomy, a timeline and a concise narrative of the development of model-based to model-free data-driven adaptive control and underlines the limitations of these techniques due to the lack of theoretical analysis. Areas of further work include theoretical analysis on stability and robustness for data-driven control systems, explainability of black-box policy learning techniques and an evaluation of the impact of the extension of system simulators to include digital twins.

引用

页码：32190 / 32212

页数：23

共 237 条

[1] Towards Intelligent Power Electronics-Dominated Grid via Machine Learning Techniques [J].

Abu-Rub, Omar H. ;

Fard, Amin Y. ;

Umar, Muhammad Farooq ;

Hosseinzadehtaher, Mohsen ;

Shadmands, Mohammad B. .

IEEE POWER ELECTRONICS MAGAZINE, 2021, 8 (01) :28-38

[2] The omnipresence of case-based reasoning in science and application [J].

Aha, DW .

KNOWLEDGE-BASED SYSTEMS, 1998, 11 (5-6) :261-273

[3] Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control [J].

Al-Tamimi, Asma ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

AUTOMATICA, 2007, 43 (03) :473-481

[4] Iterative learning control for discrete-time systems with exponential rate of convergence [J].

Amann, N ;

Owens, DH ;

Rogers, E .

IEE PROCEEDINGS-CONTROL THEORY AND APPLICATIONS, 1996, 143 (02) :217-224

[5]

Andrei N., 2006, Studies in Informatics and Control, V15, P51

[6]

[Anonymous], 2017, P INT C LEARN REPR

[7]

[Anonymous], 2013, Exploring Deep and Recurrent Architectures for Optimal Control

[8]

[Anonymous], 1999, ser. Prentice Hall information and system sciences series

[9]

[Anonymous], 1995, INTRO KALMAN FILTER

[10]

[Anonymous], 1980, Em Joint Automatic Control Conference, DOI DOI 10.1109/JACC.1980.4232010

← 1 2 3 4 5 6 7 8 9 10 →