On policy iteration-based discounted optimal control

被引:2
作者
Dong, Botao [1 ]
Huang, Longyang [1 ]
Ma, Xiwen [1 ]
Zhang, Weidong [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai, Peoples R China
[2] Hainan Univ, Sch Informat & Commun Engn, Haikou, Hainan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
continuous-time linear systems; data-driven control; discounted optimal control; policy iteration; ADAPTIVE OPTIMAL-CONTROL; TIME LINEAR-SYSTEMS; TRACKING CONTROL;
D O I
10.1002/rnc.7245
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates the policy iteration (PI) method for the discounted optimal control (DOC) problem of continuous-time linear systems. We show the properties and convergence of the PI method. The theory analysis shows that the convergence of PI can be ensured without requiring the initial admissible control gain. The convergence rate of the PI method is provided. An iteration-termination criterion is established for detecting the stability of the closed-loop system under the control gain obtained by executing PI. Two kinds of data-driven implementations are constructed without using prior information of the system dynamics. A simulation example is presented to validated the properties of the PI method.
引用
收藏
页码:4926 / 4942
页数:17
相关论文
共 38 条
[1]  
Bellman R., 1957, DYNAMIC PROGRAMMING
[2]  
Bertsekas D.P., 2012, DYNAMIC PROGRAMMING, VII
[3]   Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design [J].
Bian, Tao ;
Jiang, Zhong-Ping .
AUTOMATICA, 2016, 71 :348-360
[4]   Homotopic policy iteration-based learning design for unknown linear continuous-time systemsx2729; [J].
Chen, Ci ;
Lewis, Frank L. ;
Li, Bo .
AUTOMATICA, 2022, 138
[5]   Event-triggered Hoo consensus for uncertain nonlinear systems using integral sliding mode based adaptive dynamic programming [J].
Chen, Zitao ;
Chen, Kairui ;
Chen, Si-Zhe ;
Zhang, Yun .
NEURAL NETWORKS, 2022, 156 (258-270) :258-270
[6]   Stabilization with discounted optimal control [J].
Gaitsgory, Vladimir ;
Gruene, Lars ;
Thatcher, Neil .
SYSTEMS & CONTROL LETTERS, 2015, 82 :91-98
[7]   Finite-Horizon Discounted Optimal Control: Stability and Performance [J].
Granzotto, Mathieu ;
Postoyan, Romain ;
Busoniu, Lucian ;
Nesic, Dragan ;
Daafouz, Jamal .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (02) :550-565
[8]   Robust approximate optimal control for air-breathing hypersonic vehicle [J].
Han, Xiao ;
Liu, Lei ;
Fan, Huijin ;
Cheng, Zhongtao .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (07) :4117-4140
[9]   Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach [J].
He, Shuping ;
Fang, Haiyang ;
Zhang, Maoguang ;
Liu, Fei ;
Ding, Zhengtao .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) :549-558
[10]   RESCue: A State-Disaggregated NFV System with Resilience, Elasticity, and State Consistency [J].
Jiang, Hao ;
Chang, Hyunseok ;
Mukherjee, Sarit ;
Van der Merwe, Jacobus .
2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, :1-9