Finite-Time Identification of Linear Systems: Fundamental Limits and Optimal Algorithms

被引:10
作者
Jedra, Yassir [1 ]
Proutiere, Alexandre [1 ]
机构
[1] Royal Inst Technol KTH, Sch Elect Engn & Comp Sci, Div Decis & Control Syst, S-11428 Stockholm, Sweden
关键词
Complexity theory; Linear systems; Trajectory; System dynamics; Upper bound; Optimization; Least mean squares methods; Finite-time identification; fixed confidence identification; learning theory; least-squares estimation; linear systems; fixed budget identification; sample complexity lower bounds; system identification; TAIL PROBABILITIES; SAMPLE PROPERTIES; QUADRATIC-FORMS; CONSISTENCY; MODELS;
D O I
10.1109/TAC.2022.3221705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate the linear system identification problem in the so-called fixed budget and fixed confidence settings. In the fixed budget setting, the learner aims at estimating the state transition matrix A from a random system trajectory of fixed length, whereas in the fixed confidence setting, the learner also controls the length of the observed trajectory - she can stop when she believes that enough information has been gathered. For both settings, we analyze the sample complexity in the probably approximately correct (PAC) framework defined as the length of the observed trajectory required to identify the system parameters with prescribed accuracy and confidence levels (e, d). In the fixed budget setting, we first establish problem-specific sample complexity lower bounds. We then present a finite-time analysis of the estimation error of the least-squares estimator (LSE) for stable systems, and show that in the high-accuracy regime, the sample complexity of the LSE matches our lower bounds. Our analysis of the LSE is sharper and easier to interpret than existing analyzes, and relies on novel concentration results for the covariates matrix. In the fixed confidence setting, in addition to the estimation objective, the learner also has to decide when to stop the collection of observations. The sample complexity then corresponds to the expected stopping time. For this setting, we also provide problem specific sample complexity lower bounds. We also propose a stopping rule which combined to the LSE enjoys a sample complexity that matches our lower bounds in the high-accuracy and high-confidence regime.
引用
收藏
页码:2805 / 2820
页数:16
相关论文
共 40 条
[1]  
Abbasi-Yadkori Yasin., 2011, Advances in Neural Information Processing Systems, P2312
[2]  
Arora S., 2018, Towards provable control for unknown linear dynamical systems
[3]   On the Sample Complexity of the Linear Quadratic Regulator [J].
Dean, Sarah ;
Mania, Horia ;
Matni, Nikolai ;
Recht, Benjamin ;
Tu, Stephen .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2020, 20 (04) :633-679
[4]  
Djehiche B., 2021, ARXIV
[5]   Finite time identification in unstable linear systems [J].
Faradonbeh, Mohamad Kazem Shirani ;
Tewari, Ambuj ;
Michailidis, George .
AUTOMATICA, 2018, 96 :342-353
[6]   Explore First, Exploit Next: The True Shape of Regret in Bandit Problems [J].
Garivier, Aurelien ;
Menard, Pierre ;
Stoltz, Gilles .
MATHEMATICS OF OPERATIONS RESEARCH, 2019, 44 (02) :377-399
[7]  
González RA, 2020, PR MACH LEARN RES, V120, P191
[8]  
Goodwin G.C., 1977, Dynamic System Identification:Experiment Design and Data Analysis, V136
[9]   BOUND ON TAIL PROBABILITIES FOR QUADRATIC FORMS IN INDEPENDENT RANDOM VARIABLES [J].
HANSON, DL ;
WRIGHT, FT .
ANNALS OF MATHEMATICAL STATISTICS, 1971, 42 (03) :1079-&
[10]  
Jedra Y, 2020, IEEE DECIS CONTR P, P996, DOI 10.1109/CDC42340.2020.9304362