The intelligent critic framework for advanced optimal control

被引:137
作者
Wang, Ding [1 ,2 ,3 ,4 ]
Ha, Mingming [5 ]
Zhao, Mingming [1 ,2 ,3 ,4 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
[4] Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
[5] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Advanced optimal control; Dynamic systems; Intelligent critic; HORIZON OPTIMAL-CONTROL; TIME NONLINEAR-SYSTEMS; OPTIMAL TRACKING CONTROL; VALUE-ITERATION; FEEDBACK-CONTROL; ROBUST-CONTROL; ALGORITHMS; ADP; MODELS; GAME;
D O I
10.1007/s10462-021-10118-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The idea of optimization can be regarded as an important basis of many disciplines and hence is extremely useful for a large number of research fields, particularly for artificial-intelligence-based advanced control design. Due to the difficulty of solving optimal control problems for general nonlinear systems, it is necessary to establish a kind of novel learning strategies with intelligent components. Besides, the rapid development of computer and networked techniques promotes the research on optimal control within discrete-time domain. In this paper, the bases, the derivation, and recent progresses of critic intelligence for discrete-time advanced optimal control design are presented with an emphasis on the iterative framework. Among them, the so-called critic intelligence methodology is highlighted, which integrates learning approximators and the reinforcement formulation.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 106 条
[11]   H∞ Codesign for Uncertain Nonlinear Control Systems Based on Policy Iteration Method [J].
Fan, Quan-Yong ;
Wang, Dongsheng ;
Xu, Bin .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) :10101-10110
[12]   Adaptive Actor-Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances [J].
Fan, Quan-Yong ;
Yang, Guang-Hong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (01) :165-177
[13]   Adaptive Optimal Output Regulation of Time-Delay Systems via Measurement Feedback [J].
Gao, Weinan ;
Jiang, Zhong-Ping .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (03) :938-945
[14]   Adaptive Dynamic Programming and Adaptive Optimal Output Regulation of Linear Systems [J].
Gao, Weinan ;
Jiang, Zhong-Ping .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2016, 61 (12) :4164-4169
[15]  
Ha M, 2022, IEEE T CYBERN
[16]  
Ha M., 2021, SYST CONTROL LETT, V147, P1
[17]   Neural-network-based discounted optimal control via an integrated value iteration with accuracy guarantee [J].
Ha, Mingming ;
Wang, Ding ;
Liu, Derong .
NEURAL NETWORKS, 2021, 144 :176-186
[18]   Event-Triggered Adaptive Critic Control Design for Discrete-Time Constrained Nonlinear Systems [J].
Ha, Mingming ;
Wang, Ding ;
Liu, Derong .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09) :3158-3168
[19]   A Self-Organizing Sliding-Mode Controller for Wastewater Treatment Processes [J].
Han, Honggui ;
Wu, Xiaolong ;
Qiao, Junfei .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 27 (04) :1480-1491
[20]  
HAN X, 2021, IN PRESS