Advanced Policy Learning Near-Optimal Regulation

被引：0

作者：

Ding Wang ^{[1
,2
]}

Xiangnan Zhong ^{[1
,3
]}

机构：

[1] IEEE

[2] the Faculty of Information Technology, Beijing University of Technology, and also with the Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology

[3] the Department of Electrical Engineering, University of North Texas

来源：

IEEE/CAA Journal of Automatica Sinica | 2019年 / 6卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Adaptive critic algorithm; learning control; neural approximation; nonaffine dynamics; optimal regulation;

D O I：

暂无

中图分类号：

O232 [最优控制];

学科分类号：

070105 ; 0711 ; 071101 ; 0811 ; 081101 ;

摘要：

Designing advanced design techniques for feedback stabilization and optimization of complex systems is important to the modern control field. In this paper, a near-optimal regulation method for general nonaffine dynamics is developed with the help of policy learning. For addressing the nonaffine nonlinearity, a pre-compensator is constructed, so that the augmented system can be formulated as affine-like form. Different cost functions are defined for original and transformed controlled plants and then their relationship is analyzed in detail. Additionally, an adaptive critic algorithm involving stability guarantee is employed to solve the augmented optimal control problem. At last, several case studies are conducted for verifying the stability, robustness, and optimality of a torsional pendulum plant with suitable cost.

引用

页码：743 / 749

页数：7

共 50 条

[41] Input and plant parameter optimization via learning optimal control of Hamiltonian systems
Satoh, Satoshi
Fujimoto, Kenji
Saeki, Masami
IFAC PAPERSONLINE, 2015, 48 (13): : 57 - 62
[42] A computationally efficient norm optimal iterative learning control approach for LTV systems
Sun, Heqing
Alleyne, Andrew G.
AUTOMATICA, 2014, 50 (01) : 141 - 148
[43] Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot
Xu, X
He, HG
PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2002, : 758 - 763
[44] Control of golf swing robot by learning - Generation of optimal trajectory for real system
Ming, A
Luan, N
Kajitani, M
ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 2002, : 143 - 152
[45] A hierarchical reinforcement learning approach for optimal path tracking of wheeled mobile robots
Lei Zuo
Xin Xu
Chunming Liu
Zhenhua Huang
Neural Computing and Applications, 2013, 23 : 1873 - 1883
[46] Study on the Contribution of Energy Saving and Emission Reducing of Wujiang seven cascade reservoirs' optimal regulation
Zhang Yunfeng
Zhang Zezhong
Qi Qingqing
2011 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND CONTROL (ICECC), 2011, : 4541 - 4543
[47] Optimal Regulation Strategy of Electric Vehicle Charging and Discharging Based on Dynamic Regional Dispatching Price
Yu, Shaohua
Du, Zhaobin
Chen, Lidan
FRONTIERS IN ENERGY RESEARCH, 2022, 10
[48] An optimal regulation method for distribution network cluster considering heterogeneous characteristics of distributed photovoltaic resource
Lin, Da
Li, Junhao
Ni, Chouwei
Yang, Chunzhi
Chen, Zhe
Tu, Chunming
2024 THE 7TH INTERNATIONAL CONFERENCE ON ENERGY, ELECTRICAL AND POWER ENGINEERING, CEEPE 2024, 2024, : 1405 - 1411
[49] A policy iteration approach to online optimal control of continuous-time constrained-input systems
Modares, Hamidreza
Sistani, Mohammad-Bagher Naghibi
Lewis, Frank L.
ISA TRANSACTIONS, 2013, 52 (05) : 611 - 621
[50] Robust Policy Learning Control of Nonlinear Plants With Case Studies for a Power System Application
Wang, Ding
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (03) : 1733 - 1741

← 1 2 3 4 5 →