Advanced Policy Learning Near-Optimal Regulation

被引:0
作者
Ding Wang [1 ,2 ]
Xiangnan Zhong [1 ,3 ]
机构
[1] IEEE
[2] the Faculty of Information Technology, Beijing University of Technology, and also with the Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology
[3] the Department of Electrical Engineering, University of North Texas
基金
中国国家自然科学基金;
关键词
Adaptive critic algorithm; learning control; neural approximation; nonaffine dynamics; optimal regulation;
D O I
暂无
中图分类号
O232 [最优控制];
学科分类号
070105 ; 0711 ; 071101 ; 0811 ; 081101 ;
摘要
Designing advanced design techniques for feedback stabilization and optimization of complex systems is important to the modern control field. In this paper, a near-optimal regulation method for general nonaffine dynamics is developed with the help of policy learning. For addressing the nonaffine nonlinearity, a pre-compensator is constructed, so that the augmented system can be formulated as affine-like form. Different cost functions are defined for original and transformed controlled plants and then their relationship is analyzed in detail. Additionally, an adaptive critic algorithm involving stability guarantee is employed to solve the augmented optimal control problem. At last, several case studies are conducted for verifying the stability, robustness, and optimality of a torsional pendulum plant with suitable cost.
引用
收藏
页码:743 / 749
页数:7
相关论文
共 50 条
  • [41] Input and plant parameter optimization via learning optimal control of Hamiltonian systems
    Satoh, Satoshi
    Fujimoto, Kenji
    Saeki, Masami
    IFAC PAPERSONLINE, 2015, 48 (13): : 57 - 62
  • [42] A computationally efficient norm optimal iterative learning control approach for LTV systems
    Sun, Heqing
    Alleyne, Andrew G.
    AUTOMATICA, 2014, 50 (01) : 141 - 148
  • [43] Residual-gradient-based neural reinforcement learning for the optimal control of an acrobot
    Xu, X
    He, HG
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL, 2002, : 758 - 763
  • [44] Control of golf swing robot by learning - Generation of optimal trajectory for real system
    Ming, A
    Luan, N
    Kajitani, M
    ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 2002, : 143 - 152
  • [45] A hierarchical reinforcement learning approach for optimal path tracking of wheeled mobile robots
    Lei Zuo
    Xin Xu
    Chunming Liu
    Zhenhua Huang
    Neural Computing and Applications, 2013, 23 : 1873 - 1883
  • [46] Study on the Contribution of Energy Saving and Emission Reducing of Wujiang seven cascade reservoirs' optimal regulation
    Zhang Yunfeng
    Zhang Zezhong
    Qi Qingqing
    2011 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND CONTROL (ICECC), 2011, : 4541 - 4543
  • [47] Optimal Regulation Strategy of Electric Vehicle Charging and Discharging Based on Dynamic Regional Dispatching Price
    Yu, Shaohua
    Du, Zhaobin
    Chen, Lidan
    FRONTIERS IN ENERGY RESEARCH, 2022, 10
  • [48] An optimal regulation method for distribution network cluster considering heterogeneous characteristics of distributed photovoltaic resource
    Lin, Da
    Li, Junhao
    Ni, Chouwei
    Yang, Chunzhi
    Chen, Zhe
    Tu, Chunming
    2024 THE 7TH INTERNATIONAL CONFERENCE ON ENERGY, ELECTRICAL AND POWER ENGINEERING, CEEPE 2024, 2024, : 1405 - 1411
  • [49] A policy iteration approach to online optimal control of continuous-time constrained-input systems
    Modares, Hamidreza
    Sistani, Mohammad-Bagher Naghibi
    Lewis, Frank L.
    ISA TRANSACTIONS, 2013, 52 (05) : 611 - 621
  • [50] Robust Policy Learning Control of Nonlinear Plants With Case Studies for a Power System Application
    Wang, Ding
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (03) : 1733 - 1741