An adaptive dynamic programming-based algorithm for infinite-horizon linear quadratic stochastic optimal control problems

被引:0
|
作者
Heng Zhang
机构
[1] Shandong University,School of Control Science and Engineering
来源
Journal of Applied Mathematics and Computing | 2023年 / 69卷
关键词
Linear quadratic stochastic optimal control; Policy iteration; Model-free; Adaptive dynamic programming; 93E03; 93E20;
D O I
暂无
中图分类号
学科分类号
摘要
This paper develops a novel adaptive dynamic programming (ADP)-based model-free policy iteration (PI) algorithm to solve an infinite-horizon continuous-time linear quadratic stochastic (LQS) optimal control problem, where the diffusion term in system dynamics contains both control and state variables. First, we apply Ito’s lemma and take expectations to describe a relationship among the state trajectory, the control input and the matrices to be solved. Then, without needing the information of all system coefficient matrices, the ADP-based model-free algorithm is developed to approximate the optimal control from the collected data. Moreover, we give the convergence analysis under some mild conditions. Finally, a numerical example and an illustrative application are served to show that the proposed algorithm is effective.
引用
收藏
页码:2741 / 2760
页数:19
相关论文
共 50 条