Discounted Near-Optimal Control of Affine Systems via a Progressive Cost Evolution Formulation

被引：1

作者：

Wang, Ding ^{[1
,2
]}

Wu, Junlong ^{[1
,2
]}

Hu, Lingzhi ^{[1
,2
]}

Qiao, Junfei ^{[1
,2
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

[2] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS | 2023年 / 70卷 / 04期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Iterative methods; Cost function; Optimal control; Convergence; Nonlinear systems; Costs; Stability criteria; Adaptive critic; attraction region; combined value iteration; evolving control sequence; intelligent control; nonlinear systems; VALUE-ITERATION; STABILITY ANALYSIS;

D O I：

10.1109/TCSII.2022.3224289

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this essay, we develop a combined value iteration (CVI) framework to address discounted optimal control problems for discrete-time affine nonlinear systems. First, generated by novel value iteration (NVI), the admissibility is investigated for the iterative control policy. Note that the relaxation factor leads to the adjustable convergence speed. Second, the constraint condition for the discount factor is established to guarantee the admissibility of the iterative control policy derived from stabilizing value iteration (SVI). In addition, the monotonicity is discussed for the iterative cost function sequence. More importantly, CVI is constructed based on NVI and SVI. Third, produced by CVI, the system stability under the evolving control sequence is ensured by the introduction of the attraction region. In the end, a numerical example is involved to confirm the related theoretical results.

引用

页码：1535 / 1539

页数：5

共 20 条

[1] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
Al-Tamimi, Asma
Lewis, Frank L.
Abu-Khalaf, Murad
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
[2] Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update
Dierks, Travis
Jagannathan, Sarangapani
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (07) : 1118 - 1129
[3] Nonlinear Robust Optimal Control via Adaptive Dynamic Programming of Permanent-Magnet Linear Synchronous Motor Drive for Uncertain Two-Axis Motion Control System
El-Sousy, Fayez F. M.
Abuhasel, Khaled Ali
[J]. IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2020, 56 (02) : 1940 - 1952
[4] Ha M., 2021, SYST CONTROL LETT, V147, P1
[5] Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (07) : 1262 - 1272
[6] A Novel Value Iteration Scheme With Adjustable Convergence Rate
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7430 - 7442
[7] Offline and Online Adaptive Critic Control Designs With Stability Guarantee Through Value Iteration
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13262 - 13274
[8] Stability Analysis of Optimal Adaptive Control Under Value Iteration Using a Stabilizing Initial Policy
Heydari, Ali
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 4522 - 4527
[9] Optimal control for discrete-time affine non-linear systems using general value iteration
Li, H.
Liu, D.
[J]. IET CONTROL THEORY AND APPLICATIONS, 2012, 6 (18) : 2725 - 2736
[10] Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems With State Constraints
Li, Yongming
Liu, Yanjun
Tong, Shaocheng
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 3131 - 3145

← 1 2 →