Toward Data-Driven Optimal Control: A Systematic Review of the Landscape

被引:28
作者
Prag, Krupa [1 ]
Woolway, Matthew [2 ]
Celik, Turgay [3 ,4 ]
机构
[1] Univ Witwatersrand, Sch Comp Sci & Appl Math, ZA-200 Johannesburg, South Africa
[2] Univ Johannesburg, Fac Engn & Built Environm, ZA-2006 Johannesburg, South Africa
[3] Univ Witwatersrand, Sch Elect & Informat Engn, ZA-2000 Johannesburg, South Africa
[4] Univ Witwatersrand, Wits Inst Data Sci, ZA-2000 Johannesburg, South Africa
关键词
Control systems; Mathematical models; Adaptation models; Data models; Adaptive control; Optimal control; Tuning; Data-driven control; adaptive control; model-free; model-based; model predictive control; optimal control; learning-based control; systematic review; MODEL-PREDICTIVE CONTROL; ITERATIVE LEARNING CONTROL; DISCRETE-TIME-SYSTEMS; FUZZY-LOGIC; ADAPTIVE-CONTROL; NEURAL-NETWORK; CONTROL DESIGN; ROBOT CONTROL; MPC; MANAGEMENT;
D O I
10.1109/ACCESS.2022.3160709
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This literature review extends and contributes to research on the development of data-driven optimal control. Previous reviews have documented the development of model-based and data-driven control in isolation and have not critically reviewed reinforcement learning approaches for adaptive data-driven optimal control frameworks. The presented review discusses the development of model-based to model-free adaptive controllers, highlighting the use of data in control frameworks. In data-driven control frameworks, reinforcement learning methods may be used to derive the optimal policy for dynamical systems. Attractive characteristics of these methods include not requiring a mathematical model of complex systems, their inherent adaptive control capabilities, being an unsupervised learning technique and their decision-making abilities, which are both an advantage and motivation behind this approach. This review considers previous reviews on these topics, including recent work on data-driven control methods. In addition, this review shows the use of data to derive system dynamics, determine the control policy using feedback information, and tune fixed controllers. Furthermore, the review summarises various data-driven methods and their corresponding characteristics. Finally, the review provides a taxonomy, a timeline and a concise narrative of the development of model-based to model-free data-driven adaptive control and underlines the limitations of these techniques due to the lack of theoretical analysis. Areas of further work include theoretical analysis on stability and robustness for data-driven control systems, explainability of black-box policy learning techniques and an evaluation of the impact of the extension of system simulators to include digital twins.
引用
收藏
页码:32190 / 32212
页数:23
相关论文
共 237 条
  • [1] Towards Intelligent Power Electronics-Dominated Grid via Machine Learning Techniques
    Abu-Rub, Omar H.
    Fard, Amin Y.
    Umar, Muhammad Farooq
    Hosseinzadehtaher, Mohsen
    Shadmands, Mohammad B.
    [J]. IEEE POWER ELECTRONICS MAGAZINE, 2021, 8 (01): : 28 - 38
  • [2] The omnipresence of case-based reasoning in science and application
    Aha, DW
    [J]. KNOWLEDGE-BASED SYSTEMS, 1998, 11 (5-6) : 261 - 273
  • [3] Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
    Al-Tamimi, Asma
    Lewis, Frank L.
    Abu-Khalaf, Murad
    [J]. AUTOMATICA, 2007, 43 (03) : 473 - 481
  • [4] Iterative learning control for discrete-time systems with exponential rate of convergence
    Amann, N
    Owens, DH
    Rogers, E
    [J]. IEE PROCEEDINGS-CONTROL THEORY AND APPLICATIONS, 1996, 143 (02): : 217 - 224
  • [5] Andrei N., 2006, Studies in Informatics and Control, V15, P51
  • [6] [Anonymous], 1990, Neural Networks for Control
  • [7] Control and Machine Intelligence for System Autonomy
    Antsaklis, Panos J.
    Rahnama, Arash
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2018, 91 (01) : 23 - 34
  • [8] An Energy Management System Design Using Fuzzy Logic Control: Smoothing the Grid Power Profile of a Residential Electro-Thermal Microgrid
    Arcos-Aviles, Diego
    Pascual, Julio
    Guinjoan, Francesc
    Marroyo, Luis
    Garcia-Gutierrez, Gabriel
    Gordillo-Orquera, Rodolfo
    Llanos-Proano, Jacqueline
    Sanchis, Pablo
    Motoasca, T. Emilia
    [J]. IEEE ACCESS, 2021, 9 : 25172 - 25188
  • [9] BETTERING OPERATION OF ROBOTS BY LEARNING
    ARIMOTO, S
    KAWAMURA, S
    MIYAZAKI, F
    [J]. JOURNAL OF ROBOTIC SYSTEMS, 1984, 1 (02): : 123 - 140
  • [10] Ariyur K. B., 2003, REAL TIME OPTIMIZATI