Cautious Nonlinear Covariance Steering using Variational Gaussian Process Predictive Models

被引:1
作者
Tsolovikos, Alexandros [1 ]
Bakolas, Efstathios [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
来源
IFAC PAPERSONLINE | 2021年 / 54卷 / 20期
关键词
nonparametric methods; nonlinear system identification; stochastic system identification; covariance steering; stochastic optimal control problems;
D O I
10.1016/j.ifacol.2021.11.153
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we consider the problem of steering the first two moments of the uncertain state of an unknown discrete-time stochastic nonlinear system to a given terminal distribution in finite time. Toward that goal, first, a non-parametric predictive model is learned from a set of available training data points using stochastic variational Gaussian process regression: a powerful and highly scalable machine learning tool for learning distributions over arbitrary nonlinear functions. Second, we formulate a tractable nonlinear covariance steering algorithm that utilizes the learned Gaussian process predictive model to compute a feedback policy that will drive the distribution of the state of the system close to the goal distribution. In a greedy approach, we linearize the Gaussian process model at each time step around the latest predicted mean and covariance, solve the linear covariance steering problem, and propagate the state statistics to the next time step using the unscented transform. This process is then repeated in a shrinking-horizon model predictive control fashion. The cautiousness of the Gaussian process predictive model, which captures both the process noise and modeling errors, is demonstrated in numerical simulations. Copyright (C) 2021 The Authors.
引用
收藏
页码:59 / 64
页数:6
相关论文
共 28 条
  • [1] Bakolas E., 2020, ACC
  • [2] Finite-horizon covariance control for discrete-time stochastic linear systems subject to input constraints
    Bakolas, Efstathios
    [J]. AUTOMATICA, 2018, 91 : 61 - 68
  • [3] Bakolas E, 2016, IEEE DECIS CONTR P, P1153
  • [4] Caluya KF, 2019, ARXIV190402272
  • [5] Chen Y., 2016, IEEE T AUTOM CONTROL
  • [6] Chen YX, 2016, IEEE T AUTOMAT CONTR, V61, P1170, DOI 10.1109/TAC.2015.2457791
  • [7] Gardner Jacob R, 2018, ARXIV180911165
  • [8] Goldshtein M, 2017, IEEE DECIS CONTR P, DOI 10.1109/CDC.2017.8264189
  • [9] Grimes D.B., 2006, P ROBOTICS SCI SYSTE, P199
  • [10] Hensman J., 2013, PROC 29 C UNCERTAINT, P282, DOI DOI 10.48550/ARXIV.1309.6835