Control invariant set enhanced safe reinforcement learning: Improved sampling efficiency, guaranteed stability and robustness

被引:2
|
作者
Bo, Song [1 ]
Agyeman, Bernard T. [1 ]
Yin, Xunyuan [2 ]
Liu, Jinfeng [1 ]
机构
[1] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB T6G 1H9, Canada
[2] Nanyang Technol Univ, Sch Chem Chem Engn & Biotechnol, Singapore 637459, Singapore
基金
加拿大自然科学与工程研究理事会;
关键词
Advanced process control; Robustness control invariant set; Reinforcement learning; Closed-loop stability; Sampling efficiency; MODEL-PREDICTIVE CONTROL; APPROXIMATIONS;
D O I
10.1016/j.compchemeng.2023.108413
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Reinforcement learning (RL) is an area of significant research interest, and safe RL in particular is attracting attention due to its ability to handle safety-driven constraints that are crucial for real-world applications. This work proposes a novel approach to RL training, called control invariant set (CIS) enhanced RL, which leverages the advantages of utilizing the explicit form of CIS to improve stability guarantees and sampling efficiency. Furthermore, the robustness of the proposed approach is investigated in the presence of uncertainty. The approach consists of two learning stages: offline and online. In the offline stage, CIS is incorporated into the reward design, initial state sampling, and state reset procedures. This incorporation of CIS facilitates improved sampling efficiency during the offline training process. In the online stage, RL is retrained whenever the predicted next step state is outside of the CIS, which serves as a stability criterion, by introducing a Safety Supervisor to examine the safety of the action and make necessary corrections. The stability analysis is conducted for both cases, with and without uncertainty. To evaluate the proposed approach, we apply it to a simulated chemical reactor. The results show a significant improvement in sampling efficiency during offline training and closed-loop stability guarantee in the online implementation, with and without uncertainty.
引用
收藏
页数:13
相关论文
共 11 条
  • [1] NUMERICAL ROBUSTNESS AND EFFICIENCY OF GENERALIZED PREDICTIVE CONTROL ALGORITHMS WITH GUARANTEED STABILITY
    ROSSITER, JA
    KOUVARITAKIS, B
    IEE PROCEEDINGS-CONTROL THEORY AND APPLICATIONS, 1994, 141 (03): : 154 - 162
  • [2] Improved Robustness Analysis of Reinforcement Learning Embedded Control Systems
    Kim, Jongrae
    ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 104 - 115
  • [3] Improved Robustness and Safety for Autonomous Vehicle Control with Adversarial Reinforcement Learning
    Ma, Xiaobai
    Driggs-Campbell, Katherine
    Kochenderfer, Mykel J.
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1665 - 1671
  • [4] Improved Reinforcement Learning Using Stability Augmentation With Application to Quadrotor Attitude Control
    Wu, Hangxing
    Ye, Hui
    Xue, Wentao
    Yang, Xiaofei
    IEEE ACCESS, 2022, 10 : 67590 - 67604
  • [5] Intelligent generation method of power system stability control strategy based on safe reinforcement learning
    Qiu J.
    Zhu Y.
    Zhang J.
    Zhu Y.
    Xu G.
    Tu L.
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2024, 52 (10): : 147 - 155
  • [6] On-policy learning-based deep reinforcement learning assessment for building control efficiency and stability
    Lee, Joon-Yong
    Rahman, Aowabin
    Huang, Sen
    Smith, Amanda D.
    Katipamula, Srinivas
    SCIENCE AND TECHNOLOGY FOR THE BUILT ENVIRONMENT, 2022, 28 (09) : 1150 - 1165
  • [7] Adaptive optimal control of unknown discrete-time linear systems with guaranteed prescribed degree of stability using reinforcement learning
    Razavi, Seyed Ehsan
    Moradi, Mohammad Amin
    Shamaghdari, Saeed
    Menhaj, Mohammad Bagher
    INTERNATIONAL JOURNAL OF DYNAMICS AND CONTROL, 2022, 10 (03) : 870 - 878
  • [8] Adaptive optimal control of unknown discrete-time linear systems with guaranteed prescribed degree of stability using reinforcement learning
    Seyed Ehsan Razavi
    Mohammad Amin Moradi
    Saeed Shamaghdari
    Mohammad Bagher Menhaj
    International Journal of Dynamics and Control, 2022, 10 : 870 - 878
  • [9] Data Driven Decentralized Control of Inverter Based Renewable Energy Sources Using Safe Guaranteed Multi-Agent Deep Reinforcement Learning
    Zhang, Mengfan
    Guo, Guodong
    Magnusson, Sindri
    Pilawa-Podgurski, Robert C. N.
    Xu, Qianwen
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2024, 15 (02) : 1288 - 1299
  • [10] Online Preventive Control for Transmission Overload Relief Using Safe Reinforcement Learning With Enhanced Spatial-Temporal Awareness
    Cui, Han
    Ye, Yujian
    Hu, Jianxiong
    Tang, Yi
    Lin, Zizhao
    Strbac, Goran
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2024, 39 (01) : 517 - 532