On the Sample Complexity of the Linear Quadratic Regulator

被引:197
作者
Dean, Sarah [1 ]
Mania, Horia [1 ]
Matni, Nikolai [2 ]
Recht, Benjamin [1 ]
Tu, Stephen [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[2] CALTECH, Dept Comp & Math Sci, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
Optimal control; Robust control; System identification; Statistical learning theory; Reinforcement learning; System level synthesis; SYSTEM-IDENTIFICATION; BOUNDS; CONVERGENCE; RATES;
D O I
10.1007/s10208-019-09426-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper addresses the optimal control problem known as the linear quadratic regulator in the case when the dynamics are unknown. We propose a multistage procedure, calledCoarse-ID control, that estimates a model from a few experimental trials, estimates the error in that model with respect to the truth, and then designs a controller using both the model and uncertainty estimate. Our technique uses contemporary tools from random matrix theory to bound the error in the estimation procedure. We also employ a recently developed approach to control synthesis calledSystem Level Synthesisthat enables robust control design by solving a quasi-convex optimization problem. We provide end-to-end bounds on the relative error in control cost that are optimal in the number of parameters and that highlight salient properties of the system to be controlled such as closed-loop sensitivity and optimal control magnitude. We show experimentally that the Coarse-ID approach enables efficient computation of a stabilizing controller in regimes where simple control schemes that do not take the model uncertainty into account fail to stabilize the true system.
引用
收藏
页码:633 / 679
页数:47
相关论文
共 64 条
  • [1] Abbasi-Yadkori Yasin, 2011, COLT
  • [2] Abeille M, 2017, PR MACH LEARN RES, V54, P1246
  • [3] Abeille Marc., 2018, INT C MACHINE LEARNI, P1
  • [4] Anderson J, 2017, ANN ALLERTON CONF, P982, DOI 10.1109/ALLERTON.2017.8262844
  • [5] Borrelli F., 2017, Predictive Control for Linear and Hybrid Systems
  • [6] COMPUTATIONAL-COMPLEXITY OF MU-CALCULATION
    BRAATZ, RP
    YOUNG, PM
    DOYLE, JC
    MORARI, M
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1994, 39 (05) : 1000 - 1002
  • [7] BRADTKE SJ, 1994, PROCEEDINGS OF THE 1994 AMERICAN CONTROL CONFERENCE, VOLS 1-3, P3475
  • [8] Finite sample properties of system identification methods
    Campi, MC
    Weyer, E
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2002, 47 (08) : 1329 - 1334
  • [9] Chen, 2000, APPROACH
  • [10] CHEN J, 1993, PROCEEDINGS OF THE 32ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P68, DOI 10.1109/CDC.1993.325188