Safety reinforcement learning control via transfer learning

被引:1
|
作者
Zhang, Quanqi [1 ]
Wu, Chengwei [1 ]
Tian, Haoyu [1 ]
Gao, Yabin [1 ]
Yao, Weiran [1 ]
Wu, Ligang [1 ]
机构
[1] Harbin Inst Technol, Dept Control Sci & Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning control; Safety; Stability; Transfer learning; LYAPUNOV FUNCTIONS;
D O I
10.1016/j.automatica.2024.111714
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) has emerged as a promising approach for modern control systems. However, its success in real-world applications has been limited due to the lack of safety guarantees. To address this issue, the authors present a novel transfer learning framework that facilitates policy training in a non-dangerous environment, followed by transfer of the trained policy to the original dangerous environment. The transferred policy is theoretically proven to stabilize the original system while maintaining safety. Additionally, we propose an uncertainty learning algorithm incorporated in RL that overcomes natural data cascading and data evolution problems in RL to enhance learning accuracy. The transfer learning framework avoids trial-and-error in unsafe environments, ensuring not only after-learning safety but, more importantly, addressing the challenging problem of safe exploration during learning. Simulation results demonstrate the promise of the transfer learning framework for RL safety control on the task of vehicle lateral stability control with safety constraints. (c) 2024 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Automated Transfer for Reinforcement Learning Tasks
    Ammar, Haitham Bou
    Chen, Siqi
    Tuyls, Karl
    Weiss, Gerhard
    KUNSTLICHE INTELLIGENZ, 2014, 28 (01): : 7 - 14
  • [22] Learning relational options for inductive transfer in relational reinforcement learning
    Croonenborghs, Tom
    Driessens, Kurt
    Bruynooghe, Maurice
    INDUCTIVE LOGIC PROGRAMMING, 2008, 4894 : 88 - 97
  • [23] A substructure transfer reinforcement learning method based on metric learning
    Chai, Peihua
    Chen, Bilian
    Zeng, Yifeng
    Yu, Shenbao
    NEUROCOMPUTING, 2024, 598
  • [24] Multi-source Transfer Learning for Deep Reinforcement Learning
    Garcia-Ramirez, Jesus
    Morales, Eduardo
    Escalante, Hugo Jair
    PATTERN RECOGNITION (MCPR 2021), 2021, 12725 : 131 - 140
  • [25] Local instance-based transfer learning for reinforcement learning
    Li, Xiaoguang
    Ji, Wanting
    Huang, Jidong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [26] Learning to Predict Consequences as a Method of Knowledge Transfer in Reinforcement Learning
    Chalmers, Eric
    Contreras, Edgar Bermudez
    Robertson, Brandon
    Luczak, Artur
    Gruber, Aaron
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2259 - 2270
  • [27] A Hybrid Cloud and Edge Control Strategy for Demand Responses Using Deep Reinforcement Learning and Transfer Learning
    Tao, Yuechuan
    Qiu, Jing
    Lai, Shuying
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (01) : 56 - 71
  • [28] Intelligent H∞ Control for UAVs via Fuzzy Deep Reinforcement Learning
    Cheng, Haoyu
    Wang, Meng
    Ma, Yifeng
    Jiao, Jiayue
    Song, Ruijia
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 7182 - 7187
  • [29] Transfer learning with Partially Constrained Models: Application to reinforcement learning of linked multicomponent robot system control
    Fernandez-Gauna, Borja
    Manuel Lopez-Guede, Jose
    Grana, Manuel
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (07) : 694 - 703
  • [30] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
    Xu, Dongsheng
    Qiao, Peng
    Dou, Yong
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551