Safe control of nonlinear systems in LPV framework using model-based reinforcement learning

被引:7
作者
Bao, Yajie [1 ]
Velni, Javad Mohammadpour [1 ]
机构
[1] Univ Georgia, Sch Elect & Comp Engn, Athens, GA 30602 USA
基金
美国国家科学基金会;
关键词
Safe nonlinear control; model-based reinforcement learning; LPV framework; PREDICTIVE CONTROL; IDENTIFICATION;
D O I
10.1080/00207179.2022.2029945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a safe model-based reinforcement learning (MBRL) approach to control nonlinear systems described by linear parameter-varying (LPV) models. A variational Bayesian inference Neural Network (BNN) approach is first employed to learn a state-space model with uncertainty quantification from input-output data collected from the system; the model is then utilised for training MBRL to learn control actions for the system with safety guarantees. Specifically, MBRL employs the BNN model to generate simulation environments for training, which avoids safety violations in the exploration stage. To adapt to dynamically varying environments, knowledge on the evolution of LPV model scheduling variables is incorporated in simulation to reduce the discrepancy between the transition distributions of simulation and real environments. Experiments on a parameter-varying double integrator system and a control moment gyroscope (CMG) simulation model demonstrate that the proposed approach can safely achieve desired control performance.
引用
收藏
页码:1078 / 1089
页数:12
相关论文
共 38 条
[1]   LPV state-feedback control of a control moment gyroscope [J].
Abbas, Hossam Seddik ;
Ali, Ahsan ;
Hashemi, Seyed Mahdi ;
Werner, Herbert .
CONTROL ENGINEERING PRACTICE, 2014, 24 :129-137
[2]  
Akametalu AK, 2014, IEEE DECIS CONTR P, P1424, DOI 10.1109/CDC.2014.7039601
[3]  
[Anonymous], 2019, Advances in Neural Information Processing Systems (NeurIPS)
[4]  
[Anonymous], 2013, PLAYING ATARI DEEP R
[5]  
[Anonymous], 2019, ADV NEUR IN
[6]  
[Anonymous], 2016, INT C LEARN REPR
[7]  
Bao Y., 2020, DYN SYST CONTR C, V84270
[8]  
Bao YJ, 2021, 2021 EUROPEAN CONTROL CONFERENCE (ECC), P150, DOI 10.23919/ECC54610.0000/2021.9655004
[9]   Identification of State-space Linear Parameter-varying Models Using Artificial Neural Networks [J].
Bao, Yajie ;
Velni, Javad Mohammadpour ;
Basina, Aditya ;
Shahbakhti, Mahdi .
IFAC PAPERSONLINE, 2020, 53 (02) :5286-5291
[10]   Epistemic Uncertainty Quantification in State-Space LPV Model Identification Using Bayesian Neural Networks [J].
Bao, Yajie ;
Velni, Javad Mohammadpour ;
Shahbakhti, Mahdi .
IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (02) :719-724