A survey of adaptive optimal control theory

被引:2
作者
Pei, Xiaoxuan [1 ]
Li, Kewen [1 ]
Li, Yongming [1 ]
机构
[1] Liaoning Univ Technol, Coll Sci, Jinzhou 121001, Peoples R China
基金
中国国家自然科学基金;
关键词
optimal control; ADP; backstepping design; neural networks; application; APPROXIMATE OPTIMAL-CONTROL; FINITE-TIME STABILITY; NONLINEAR-SYSTEMS; ALGORITHM; DESIGN;
D O I
10.3934/mbe.2022561
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper makes a survey about the recent development of optimal control based on adaptive dynamic programming (ADP). First of all, based on DP algorithm and reinforcement learning (RL) algorithm, the origin and development of the optimization idea and its application in the control field are introduced. The second part introduces achievements in the optimal control direction, then we classify and summarize the research results of optimization method, constraint problem, structure design in control algorithm and practical engineering process based on optimal control. Finally, the possible future research topics are discussed. Through a comprehensive and complete investigation of its application in many existing fields, this survey fully demonstrates that the optimal control algorithms via ADP with critic-actor neural network (NN) structure, which also have a broad application prospect, and some developed optimal control design algorithms have been applied to practical engineering fields.
引用
收藏
页码:12058 / 12072
页数:15
相关论文
共 50 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[3]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[4]   Reinforcement Learning-Based Fixed-Time Trajectory Tracking Control for Uncertain Robotic Manipulators With Input Saturation [J].
Cao, Shengjie ;
Sun, Liang ;
Jiang, Jingjing ;
Zuo, Zongyu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) :4584-4595
[5]   Sensor fault estimation for hydraulic servo actuator based on sliding mode observer [J].
Djordjevic, Vladimir ;
Dubonjic, Ljubisa ;
Morato, Marcelo Menezes ;
Prsic, Dragan ;
Stojanovic, Vladimir .
MATHEMATICAL MODELLING AND CONTROL, 2022, 2 (01) :34-43
[6]   Locally optimal and robust backstepping design [J].
Ezal, K ;
Pan, ZG ;
Kokotovic, PV .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (02) :260-271
[7]   Inverse optimality in robust stabilization [J].
Freeman, RA ;
Kokotovic, PV .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1996, 34 (04) :1365-1391
[8]   Fixed-time control of delayed neural networks with impulsive perturbations [J].
Hu, Jingting ;
Sui, Guixia ;
Lv, Xiaoxiao ;
Li, Xiaodi .
NONLINEAR ANALYSIS-MODELLING AND CONTROL, 2018, 23 (06) :904-920
[9]   Input-to-state stability of delayed systems with bounded-delay impulses [J].
Jiang, Bangxin ;
Lou, Yijun ;
Lu, Jianquan .
MATHEMATICAL MODELLING AND CONTROL, 2022, 2 (02) :44-54
[10]  
Kalman R. E., 1964, J BASIC ENG-T ASME, DOI [DOI 10.1115/1.3653115, 10.1115/1.3653115]