Robust value iteration for optimal control of discrete-time linear systems

被引:0
作者
Lai, Jing [1 ]
Xiong, Junlin [2 ]
机构
[1] Hefei Univ Technol, Sch Elect Engn & Automat, Hefei 230009, Peoples R China
[2] Univ Sci & Technol China, Dept Automat, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
Value iteration; Robust analysis; Reinforcement learning; Stochastic systems; ADAPTIVE OPTIMAL-CONTROL; CONE;
D O I
10.1016/j.automatica.2025.112121
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates properties of value iteration in the presence of deviations, starting from a benchmark control problem for discrete-time linear systems. Using properties of invariant metrics, value iteration for the considered control problem is demonstrated to be robust to small deviations. Specifically, value iteration enjoys a non-asymptotic convergence property when the deviations keep small in the execution, and generates solutions that converge to a small neighborhood of the optimal ones. As an extension, an optimistic model-free value iteration is proposed for systems suffering from additive noise of zero mean with the estimation error analysis and convergence analysis. The proposed results are illustrated through numerical simulations. (c) 2025 Published by Elsevier Ltd.
引用
收藏
页数:8
相关论文
共 32 条