Confidence Regions for Predictions of Online Learning-Based Control

被引：1

作者：

Capone, Alexandre ^{[1
]}

Lederer, Armin ^{[1
]}

Hirche, Sandra ^{[1
]}

机构：

[1] Tech Univ Munich, Dept Elect & Comp Engn, Chair Informat Oriented Control ITR, Munich, Germany

来源：

IFAC PAPERSONLINE | 2020年 / 53卷 / 02期

关键词：

Gaussian processes; system identification; nonlinear systems; stochastic systems; Monte Carlo simulation; error estimation; GAUSSIAN-PROCESSES;

D O I：

10.1016/j.ifacol.2020.12.1278

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Although machine learning techniques are increasingly employed in control tasks, few methods exist to predict the behavior of closed-loop learning-based systems. In this paper, we introduce a method for computing confidence regions of closed-loop system trajectories under an online learning-based control law. We employ a sampling-based approximation and exploit system properties to prove that the computed confidence regions are correct with high probability. In a numerical simulation, we show that the proposed approach accurately predicts correct confidence regions. Copyright (C) 2020 The Authors.

引用

页码：1007 / 1012

页数：6

共 17 条

[1] Quasi-online reinforcement learning for robots [J].

Bakker, Bram ;

Zhumatiy, Viktor ;

Gruener, Gabriel ;

Schmidhuber, Juergen .

2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, :2997-+

[2] DUAL EFFECT, CERTAINTY EQUIVALENCE, AND SEPARATION IN STOCHASTIC CONTROL [J].

BARSHALOM, Y ;

TSE, E .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (05) :494-500

[3] Stable Gaussian process based tracking control of Euler-Lagrange systems [J].

Beckers, Thomas ;

Kulic, Dana ;

Hirche, Sandra .

AUTOMATICA, 2019, 103 :390-397

[4]

Berkenkamp F, 2017, ADV NEUR IN, V30

[5] Backstepping for Partially Unknown Nonlinear Systems Using Gaussian Processes [J].

Capone, Alexandre ;

Hirche, Sandra .

IEEE CONTROL SYSTEMS LETTERS, 2019, 3 (02) :416-421

[6] Bayesian Nonparametric Adaptive Control Using Gaussian Processes [J].

Chowdhary, Girish ;

Kingravi, Hassan A. ;

How, Jonathan P. ;

Vela, Patricio A. .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (03) :537-550

[7] Gaussian Processes for Data-Efficient Learning in Robotics and Control [J].

Deisenroth, Marc Peter ;

Fox, Dieter ;

Rasmussen, Carl Edward .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (02) :408-423

[8]

Kamthe S, 2018, PR MACH LEARN RES, V84

[9]

Koller T, 2018, IEEE DECIS CONTR P, P6059, DOI 10.1109/CDC.2018.8619572

[10] Stochastic model predictive control with active uncertainty learning: A Survey on dual control [J].

Mesbah, Ali .

ANNUAL REVIEWS IN CONTROL, 2018, 45 :107-117

← 1 2 →