Explorized policy iteration for continuous-time linear systems

被引：0

作者：

Chun, Tae Yoon

Choi, Yoon Ho

Park, Jin Bae

机构：

来源：

Transactions of the Korean Institute of Electrical Engineers | 2012年 / 61卷 / 03期

关键词：

Adaptive optimal control; Exploration; LQR; Persistency of excitation; Policy iteration;

D O I：

10.5370/KIEE.2012.61.3.451

中图分类号：

学科分类号：

摘要：

This paper addresses the problem that policy iteration (PI) for continuous-time (CT) systems requires explorations of the state space which is known as persistency of excitation in adaptive control community, and as a result, proposes a PI scheme explorized by an additional probing signal to solve the addressed problem. The proposed PI method efficiently finds in online fashion the related CT linear quadratic (LQ) optimal control without knowing the system matrix A, and guarantees the stability and convergence to the LQ optimal control, which is proven in this paper in the presence of the probing signal. A design method for the probing signal is also presented to balance the exploration of the state space and the control performance. Finally, several simulation results are provided to verify the effectiveness of the proposed explorized PI method.

引用

页码：451 / 458

页数：7

共 50 条

[1] Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
Lee, Jae Young
Park, Jin Bae
Choi, Yoon Ho
AUTOMATICA, 2012, 48 (11) : 2850 - 2859
[2] Robust Policy Iteration for Continuous-Time Linear Quadratic Regulation
Pang, Bo
Bian, Tao
Jiang, Zhong-Ping
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (01) : 504 - 511
[3] Adaptive optimal control for continuous-time linear systems based on policy iteration
Vrabie, D.
Pastravanu, O.
Abu-Khalaf, M.
Lewis, F. L.
AUTOMATICA, 2009, 45 (02) : 477 - 484
[4] On Integral Value Iteration for Continuous-Time Linear Systems
Lee, Jae Young
Park, Jin Bae
Choi, Yoon Ho
2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 4215 - 4220
[5] Value Iteration for Continuous-Time Linear Time-Invariant Systems
Possieri, Corrado
Sassano, Mario
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (05) : 3070 - 3077
[6] On integral generalized policy iteration for continuous-time linear quadratic regulations
Lee, Jae Young
Park, Jin Bae
Choi, Yoon Ho
AUTOMATICA, 2014, 50 (02) : 475 - 489
[7] Policy-Iteration-Based Adaptive Optimal Control for Uncertain Continuous-Time Linear Systems with Excitation Signals
Lee, Jae Young
Park, Jin Bae
Choi, Yoon Ho
INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 646 - 651
[8] Continuous-Time Time-Varying Policy Iteration
Wei, Qinglai
Liao, Zehua
Yang, Zhanyu
Li, Benkai
Liu, Derong
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (12) : 4958 - 4971
[9] Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems
Jiang, Huaiyuan
Zhou, Bin
AUTOMATICA, 2022, 136
[10] Bias-policy iteration based optimal control for unknown continuous-time linear periodic systems
Li, Xiang
Jiang, Huaiyuan
Zhou, Bin
SYSTEMS & CONTROL LETTERS, 2024, 189

← 1 2 3 4 5 →