Safe control of nonlinear systems in LPV framework using model-based reinforcement learning

被引:7
作者
Bao, Yajie [1 ]
Velni, Javad Mohammadpour [1 ]
机构
[1] Univ Georgia, Sch Elect & Comp Engn, Athens, GA 30602 USA
基金
美国国家科学基金会;
关键词
Safe nonlinear control; model-based reinforcement learning; LPV framework; PREDICTIVE CONTROL; IDENTIFICATION;
D O I
10.1080/00207179.2022.2029945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a safe model-based reinforcement learning (MBRL) approach to control nonlinear systems described by linear parameter-varying (LPV) models. A variational Bayesian inference Neural Network (BNN) approach is first employed to learn a state-space model with uncertainty quantification from input-output data collected from the system; the model is then utilised for training MBRL to learn control actions for the system with safety guarantees. Specifically, MBRL employs the BNN model to generate simulation environments for training, which avoids safety violations in the exploration stage. To adapt to dynamically varying environments, knowledge on the evolution of LPV model scheduling variables is incorporated in simulation to reduce the discrepancy between the transition distributions of simulation and real environments. Experiments on a parameter-varying double integrator system and a control moment gyroscope (CMG) simulation model demonstrate that the proposed approach can safely achieve desired control performance.
引用
收藏
页码:1078 / 1089
页数:12
相关论文
共 38 条
  • [1] LPV state-feedback control of a control moment gyroscope
    Abbas, Hossam Seddik
    Ali, Ahsan
    Hashemi, Seyed Mahdi
    Werner, Herbert
    [J]. CONTROL ENGINEERING PRACTICE, 2014, 24 : 129 - 137
  • [2] Akametalu AK, 2014, IEEE DECIS CONTR P, P1424, DOI 10.1109/CDC.2014.7039601
  • [3] Bao Y., 2020, DYN SYST CONTR C, V84270
  • [4] Bao YJ, 2021, 2021 EUROPEAN CONTROL CONFERENCE (ECC), P150, DOI 10.23919/ECC54610.0000/2021.9655004
  • [5] Identification of State-space Linear Parameter-varying Models Using Artificial Neural Networks
    Bao, Yajie
    Velni, Javad Mohammadpour
    Basina, Aditya
    Shahbakhti, Mahdi
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 5286 - 5291
  • [6] Epistemic Uncertainty Quantification in State-Space LPV Model Identification Using Bayesian Neural Networks
    Bao, Yajie
    Velni, Javad Mohammadpour
    Shahbakhti, Mahdi
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (02): : 719 - 724
  • [7] Variational Inference for Linear Systems with Latent Parameter Space
    Becker, Cassiano O.
    Preciado, Victor M.
    [J]. 2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 5662 - 5667
  • [8] Berkenkamp F, 2017, ADV NEUR IN, V30
  • [9] Variational Inference: A Review for Statisticians
    Blei, David M.
    Kucukelbir, Alp
    McAuliffe, Jon D.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 859 - 877
  • [10] Blundell C, 2015, PR MACH LEARN RES, V37, P1613