An adaptive dynamic programming-based algorithm for infinite-horizon linear quadratic stochastic optimal control problems

被引:0
作者
Heng Zhang
机构
[1] Shandong University,School of Control Science and Engineering
来源
Journal of Applied Mathematics and Computing | 2023年 / 69卷
关键词
Linear quadratic stochastic optimal control; Policy iteration; Model-free; Adaptive dynamic programming; 93E03; 93E20;
D O I
暂无
中图分类号
学科分类号
摘要
This paper develops a novel adaptive dynamic programming (ADP)-based model-free policy iteration (PI) algorithm to solve an infinite-horizon continuous-time linear quadratic stochastic (LQS) optimal control problem, where the diffusion term in system dynamics contains both control and state variables. First, we apply Ito’s lemma and take expectations to describe a relationship among the state trajectory, the control input and the matrices to be solved. Then, without needing the information of all system coefficient matrices, the ADP-based model-free algorithm is developed to approximate the optimal control from the collected data. Moreover, we give the convergence analysis under some mild conditions. Finally, a numerical example and an illustrative application are served to show that the proposed algorithm is effective.
引用
收藏
页码:2741 / 2760
页数:19
相关论文
共 50 条
[21]   Optimal Control for Fractional Stochastic Differential Systems via Adaptive Dynamic Programming [J].
Liu, Lei ;
Liu, Chen ;
Zhu, Runyu .
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, :1238-1243
[22]   Adaptive Dynamic Programming-based Fault Tolerant Control for a Quadrotor UAV Attitude Stabilization [J].
Yang, Shaoyu ;
Li, Bo ;
Liu, Hui ;
Zhang, Haichao .
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, :2612-2617
[23]   Infinite time linear quadratic stackelberg game problem for unknown stochastic discrete-time systems via adaptive dynamic programming approach [J].
Liu, Xikui ;
Liu, Ruirui ;
Li, Yan .
ASIAN JOURNAL OF CONTROL, 2021, 23 (02) :937-948
[24]   Adaptive Dynamic Programming-Based Spacecraft Attitude Control Under a Tube-Based Framework [J].
Li, Shiyi ;
Liu, Kerun ;
Liu, Ming .
ELECTRONICS, 2024, 13 (22)
[25]   Adaptive dynamic programming for finite-horizon optimal control of linear time-varying discrete-time systems [J].
Pang, Bo ;
Bian, Tao ;
Jiang, Zhong-Ping .
CONTROL THEORY AND TECHNOLOGY, 2019, 17 (01) :73-84
[26]   Adaptive dynamic programming for finite-horizon optimal control of linear time-varying discrete-time systems [J].
Bo Pang ;
Tao Bian ;
Zhong-Ping Jiang .
Control Theory and Technology, 2019, 17 :73-84
[27]   Adaptive Dynamic Programming for Finite-Horizon Optimal Tracking Control of a Class of Nonlinear Systems [J].
Wang Ding ;
Liu Derong ;
Wei Qinglai .
2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, :2450-2455
[28]   Adaptive Dynamic Programming-Based Fault-Tolerant Position-Force Control of Constrained Reconfigurable Manipulators [J].
Ma, Bing ;
Dong, Bo ;
Zhou, Fan ;
Li, Yuanchun .
IEEE ACCESS, 2020, 8 :183286-183299
[29]   Extended adaptive optimal control of linear systems with unknown dynamics using adaptive dynamic programming [J].
Gan, Minggang ;
Zhao, Jingang ;
Zhang, Chi .
ASIAN JOURNAL OF CONTROL, 2021, 23 (02) :1097-1106
[30]   Adaptive dynamic programming-based feature tracking control of visual servoing manipulators with unknown dynamics [J].
Xiaolin Ren ;
Hongwen Li .
Complex & Intelligent Systems, 2022, 8 :255-269