Confidence Regions for Predictions of Online Learning-Based Control

被引:1
作者
Capone, Alexandre [1 ]
Lederer, Armin [1 ]
Hirche, Sandra [1 ]
机构
[1] Tech Univ Munich, Dept Elect & Comp Engn, Chair Informat Oriented Control ITR, Munich, Germany
关键词
Gaussian processes; system identification; nonlinear systems; stochastic systems; Monte Carlo simulation; error estimation; GAUSSIAN-PROCESSES;
D O I
10.1016/j.ifacol.2020.12.1278
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although machine learning techniques are increasingly employed in control tasks, few methods exist to predict the behavior of closed-loop learning-based systems. In this paper, we introduce a method for computing confidence regions of closed-loop system trajectories under an online learning-based control law. We employ a sampling-based approximation and exploit system properties to prove that the computed confidence regions are correct with high probability. In a numerical simulation, we show that the proposed approach accurately predicts correct confidence regions. Copyright (C) 2020 The Authors.
引用
收藏
页码:1007 / 1012
页数:6
相关论文
共 17 条
[1]   Quasi-online reinforcement learning for robots [J].
Bakker, Bram ;
Zhumatiy, Viktor ;
Gruener, Gabriel ;
Schmidhuber, Juergen .
2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, :2997-+
[2]   DUAL EFFECT, CERTAINTY EQUIVALENCE, AND SEPARATION IN STOCHASTIC CONTROL [J].
BARSHALOM, Y ;
TSE, E .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (05) :494-500
[3]   Stable Gaussian process based tracking control of Euler-Lagrange systems [J].
Beckers, Thomas ;
Kulic, Dana ;
Hirche, Sandra .
AUTOMATICA, 2019, 103 :390-397
[4]  
Berkenkamp F, 2017, ADV NEUR IN, V30
[5]   Backstepping for Partially Unknown Nonlinear Systems Using Gaussian Processes [J].
Capone, Alexandre ;
Hirche, Sandra .
IEEE CONTROL SYSTEMS LETTERS, 2019, 3 (02) :416-421
[6]   Bayesian Nonparametric Adaptive Control Using Gaussian Processes [J].
Chowdhary, Girish ;
Kingravi, Hassan A. ;
How, Jonathan P. ;
Vela, Patricio A. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (03) :537-550
[7]   Gaussian Processes for Data-Efficient Learning in Robotics and Control [J].
Deisenroth, Marc Peter ;
Fox, Dieter ;
Rasmussen, Carl Edward .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (02) :408-423
[8]  
Kamthe S, 2018, PR MACH LEARN RES, V84
[9]  
Koller T, 2018, IEEE DECIS CONTR P, P6059, DOI 10.1109/CDC.2018.8619572
[10]   Stochastic model predictive control with active uncertainty learning: A Survey on dual control [J].
Mesbah, Ali .
ANNUAL REVIEWS IN CONTROL, 2018, 45 :107-117