Relative Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems

被引:0
作者
Sayedana, Borna [1 ]
Afshari, Mohammad [1 ]
Caines, Peter E. [1 ]
Mahajan, Aditya [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, 3480 Rue Univ, Montreal, PQ H3A 0E9, Canada
来源
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC | 2023年
关键词
D O I
10.1109/CDC49753.2023.10383246
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider learning and control problem in an unknown Markov jump linear system (MJLS) with perfect state observations. We first establish a generic upper bound on regret for any learning based algorithm. We then propose a certainty equivalence-based learning algorithm and show that this algorithm achieves a regret of O(root T log(T)) relative to a certain subset of the sample space. As part of our analysis, we revisit the switched least squares system identification algorithm of [1], [2] for autonomous MJLS and generalize it to controlled MJLS, establishing strong consistency and almost sure rates of convergence of this method.
引用
收藏
页码:6629 / 6634
页数:6
相关论文
共 19 条
  • [1] Abbasi-Yadkori Y., 2011, P 24TH ANN C LEARNIN, P1
  • [2] Abeille M., 2018, PR MACH LEARN RES, P1
  • [3] [Anonymous], 2020, Automatica, DOI DOI 10.1016/J.AUTOMATICA.2020.108982
  • [4] Analysis of Stochastic Switched Systems With Application to Networked Control Under Jamming Attacks
    Cetinkaya, Ahmet
    Ishii, Hideaki
    Hayakawa, Tomohisa
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (05) : 2013 - 2028
  • [5] Venous thromboembolism (VTE) in Europe - The number of VTE events and associated morbidity and mortality
    Cohen, Alexander T.
    Agnelli, Giancarlo
    Anderson, Frederick A.
    Arcelus, Juan I.
    Bergqvist, David
    Brecht, Josef G.
    Greer, Ian A.
    Heit, John A.
    Hutchinson, Julia L.
    Kakkar, Ajay K.
    Mottier, Dominique
    Oger, Emmanuel
    Samama, Meyer-Michel
    Spannagl, Michael
    [J]. THROMBOSIS AND HAEMOSTASIS, 2007, 98 (04) : 756 - 764
  • [6] Costa O. L. V., 2006, DISCRETE TIME MARKOV
  • [7] Czornik A., 2003, CONTROL PROBLEMS JUM
  • [8] Input-to-State Stabilizing Control Under Denial-of-Service
    De Persis, Claudio
    Tesi, Pietro
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (11) : 2930 - 2944
  • [9] Discrete-Time Switched Linear Systems State Feedback Design With Application to Networked Control
    Deaecto, Grace S.
    Souza, Matheus
    Geromel, Jose C.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (03) : 877 - 881
  • [10] Du Z., 2021, ARXIV210512358