Explorized policy iteration for continuous-time linear systems

被引:0
|
作者
Chun, Tae Yoon
Choi, Yoon Ho
Park, Jin Bae
机构
关键词
Adaptive optimal control; Exploration; LQR; Persistency of excitation; Policy iteration;
D O I
10.5370/KIEE.2012.61.3.451
中图分类号
学科分类号
摘要
This paper addresses the problem that policy iteration (PI) for continuous-time (CT) systems requires explorations of the state space which is known as persistency of excitation in adaptive control community, and as a result, proposes a PI scheme explorized by an additional probing signal to solve the addressed problem. The proposed PI method efficiently finds in online fashion the related CT linear quadratic (LQ) optimal control without knowing the system matrix A, and guarantees the stability and convergence to the LQ optimal control, which is proven in this paper in the presence of the probing signal. A design method for the probing signal is also presented to balance the exploration of the state space and the control performance. Finally, several simulation results are provided to verify the effectiveness of the proposed explorized PI method.
引用
收藏
页码:451 / 458
页数:7
相关论文
共 50 条
  • [1] Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    AUTOMATICA, 2012, 48 (11) : 2850 - 2859
  • [2] Robust Policy Iteration for Continuous-Time Linear Quadratic Regulation
    Pang, Bo
    Bian, Tao
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (01) : 504 - 511
  • [3] Adaptive optimal control for continuous-time linear systems based on policy iteration
    Vrabie, D.
    Pastravanu, O.
    Abu-Khalaf, M.
    Lewis, F. L.
    AUTOMATICA, 2009, 45 (02) : 477 - 484
  • [4] On Integral Value Iteration for Continuous-Time Linear Systems
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 4215 - 4220
  • [5] Value Iteration for Continuous-Time Linear Time-Invariant Systems
    Possieri, Corrado
    Sassano, Mario
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (05) : 3070 - 3077
  • [6] On integral generalized policy iteration for continuous-time linear quadratic regulations
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    AUTOMATICA, 2014, 50 (02) : 475 - 489
  • [7] Policy-Iteration-Based Adaptive Optimal Control for Uncertain Continuous-Time Linear Systems with Excitation Signals
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 646 - 651
  • [8] Continuous-Time Time-Varying Policy Iteration
    Wei, Qinglai
    Liao, Zehua
    Yang, Zhanyu
    Li, Benkai
    Liu, Derong
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (12) : 4958 - 4971
  • [9] Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems
    Jiang, Huaiyuan
    Zhou, Bin
    AUTOMATICA, 2022, 136
  • [10] Bias-policy iteration based optimal control for unknown continuous-time linear periodic systems
    Li, Xiang
    Jiang, Huaiyuan
    Zhou, Bin
    SYSTEMS & CONTROL LETTERS, 2024, 189