Safety reinforcement learning control via transfer learning

被引：1

作者：

Zhang, Quanqi ^{[1
]}

Wu, Chengwei ^{[1
]}

Tian, Haoyu ^{[1
]}

Gao, Yabin ^{[1
]}

Yao, Weiran ^{[1
]}

Wu, Ligang ^{[1
]}

机构：

[1] Harbin Inst Technol, Dept Control Sci & Engn, Harbin 150001, Peoples R China

来源：

AUTOMATICA | 2024年 / 166卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning control; Safety; Stability; Transfer learning; LYAPUNOV FUNCTIONS;

D O I：

10.1016/j.automatica.2024.111714

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) has emerged as a promising approach for modern control systems. However, its success in real-world applications has been limited due to the lack of safety guarantees. To address this issue, the authors present a novel transfer learning framework that facilitates policy training in a non-dangerous environment, followed by transfer of the trained policy to the original dangerous environment. The transferred policy is theoretically proven to stabilize the original system while maintaining safety. Additionally, we propose an uncertainty learning algorithm incorporated in RL that overcomes natural data cascading and data evolution problems in RL to enhance learning accuracy. The transfer learning framework avoids trial-and-error in unsafe environments, ensuring not only after-learning safety but, more importantly, addressing the challenging problem of safe exploration during learning. Simulation results demonstrate the promise of the transfer learning framework for RL safety control on the task of vehicle lateral stability control with safety constraints. (c) 2024 Elsevier Ltd. All rights reserved.

引用

页数：9

共 50 条

[21] Automated Transfer for Reinforcement Learning Tasks
Ammar, Haitham Bou
Chen, Siqi
Tuyls, Karl
Weiss, Gerhard
KUNSTLICHE INTELLIGENZ, 2014, 28 (01): : 7 - 14
[22] Learning relational options for inductive transfer in relational reinforcement learning
Croonenborghs, Tom
Driessens, Kurt
Bruynooghe, Maurice
INDUCTIVE LOGIC PROGRAMMING, 2008, 4894 : 88 - 97
[23] A substructure transfer reinforcement learning method based on metric learning
Chai, Peihua
Chen, Bilian
Zeng, Yifeng
Yu, Shenbao
NEUROCOMPUTING, 2024, 598
[24] Multi-source Transfer Learning for Deep Reinforcement Learning
Garcia-Ramirez, Jesus
Morales, Eduardo
Escalante, Hugo Jair
PATTERN RECOGNITION (MCPR 2021), 2021, 12725 : 131 - 140
[25] Local instance-based transfer learning for reinforcement learning
Li, Xiaoguang
Ji, Wanting
Huang, Jidong
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[26] Learning to Predict Consequences as a Method of Knowledge Transfer in Reinforcement Learning
Chalmers, Eric
Contreras, Edgar Bermudez
Robertson, Brandon
Luczak, Artur
Gruber, Aaron
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2259 - 2270
[27] A Hybrid Cloud and Edge Control Strategy for Demand Responses Using Deep Reinforcement Learning and Transfer Learning
Tao, Yuechuan
Qiu, Jing
Lai, Shuying
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (01) : 56 - 71
[28] Intelligent H∞ Control for UAVs via Fuzzy Deep Reinforcement Learning
Cheng, Haoyu
Wang, Meng
Ma, Yifeng
Jiao, Jiayue
Song, Ruijia
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 7182 - 7187
[29] Transfer learning with Partially Constrained Models: Application to reinforcement learning of linked multicomponent robot system control
Fernandez-Gauna, Borja
Manuel Lopez-Guede, Jose
Grana, Manuel
ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (07) : 694 - 703
[30] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
Xu, Dongsheng
Qiao, Peng
Dou, Yong
2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551

← 1 2 3 4 5 →