Discounted Near-Optimal Control of Affine Systems via a Progressive Cost Evolution Formulation

被引:1
作者
Wang, Ding [1 ,2 ]
Wu, Junlong [1 ,2 ]
Hu, Lingzhi [1 ,2 ]
Qiao, Junfei [1 ,2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Iterative methods; Cost function; Optimal control; Convergence; Nonlinear systems; Costs; Stability criteria; Adaptive critic; attraction region; combined value iteration; evolving control sequence; intelligent control; nonlinear systems; VALUE-ITERATION; STABILITY ANALYSIS;
D O I
10.1109/TCSII.2022.3224289
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this essay, we develop a combined value iteration (CVI) framework to address discounted optimal control problems for discrete-time affine nonlinear systems. First, generated by novel value iteration (NVI), the admissibility is investigated for the iterative control policy. Note that the relaxation factor leads to the adjustable convergence speed. Second, the constraint condition for the discount factor is established to guarantee the admissibility of the iterative control policy derived from stabilizing value iteration (SVI). In addition, the monotonicity is discussed for the iterative cost function sequence. More importantly, CVI is constructed based on NVI and SVI. Third, produced by CVI, the system stability under the evolving control sequence is ensured by the introduction of the attraction region. In the end, a numerical example is involved to confirm the related theoretical results.
引用
收藏
页码:1535 / 1539
页数:5
相关论文
共 20 条