The intelligent critic framework for advanced optimal control

被引：137

作者：

Wang, Ding ^{[1
,2
,3
,4
]}

Ha, Mingming ^{[5
]}

Zhao, Mingming ^{[1
,2
,3
,4
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

[2] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

[3] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China

[4] Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

[5] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE REVIEW | 2022年 / 55卷 / 01期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Advanced optimal control; Dynamic systems; Intelligent critic; HORIZON OPTIMAL-CONTROL; TIME NONLINEAR-SYSTEMS; OPTIMAL TRACKING CONTROL; VALUE-ITERATION; FEEDBACK-CONTROL; ROBUST-CONTROL; ALGORITHMS; ADP; MODELS; GAME;

D O I：

10.1007/s10462-021-10118-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The idea of optimization can be regarded as an important basis of many disciplines and hence is extremely useful for a large number of research fields, particularly for artificial-intelligence-based advanced control design. Due to the difficulty of solving optimal control problems for general nonlinear systems, it is necessary to establish a kind of novel learning strategies with intelligent components. Besides, the rapid development of computer and networked techniques promotes the research on optimal control within discrete-time domain. In this paper, the bases, the derivation, and recent progresses of critic intelligence for discrete-time advanced optimal control design are presented with an emphasis on the iterative framework. Among them, the so-called critic intelligence methodology is highlighted, which integrates learning approximators and the reinforcement formulation.

引用

页码：1 / 22

页数：22

共 106 条

[11] H∞ Codesign for Uncertain Nonlinear Control Systems Based on Policy Iteration Method [J].

Fan, Quan-Yong ;

Wang, Dongsheng ;

Xu, Bin .

IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) :10101-10110

[12] Adaptive Actor-Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances [J].

Fan, Quan-Yong ;

Yang, Guang-Hong .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (01) :165-177

[13] Adaptive Optimal Output Regulation of Time-Delay Systems via Measurement Feedback [J].

Gao, Weinan ;

Jiang, Zhong-Ping .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (03) :938-945

[14] Adaptive Dynamic Programming and Adaptive Optimal Output Regulation of Linear Systems [J].

Gao, Weinan ;

Jiang, Zhong-Ping .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2016, 61 (12) :4164-4169

[15]

Ha M, 2022, IEEE T CYBERN

[16]

Ha M., 2021, SYST CONTROL LETT, V147, P1

[17] Neural-network-based discounted optimal control via an integrated value iteration with accuracy guarantee [J].

Ha, Mingming ;

Wang, Ding ;

Liu, Derong .

NEURAL NETWORKS, 2021, 144 :176-186

[18] Event-Triggered Adaptive Critic Control Design for Discrete-Time Constrained Nonlinear Systems [J].

Ha, Mingming ;

Wang, Ding ;

Liu, Derong .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09) :3158-3168

[19] A Self-Organizing Sliding-Mode Controller for Wastewater Treatment Processes [J].

Han, Honggui ;

Wu, Xiaolong ;

Qiao, Junfei .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 27 (04) :1480-1491

[20]

HAN X, 2021, IN PRESS

← 1 2 3 4 5 6 7 8 9 10 →