A Review of Safe Reinforcement Learning: Methods, Theories, and Applications

被引:4
|
作者
Gu, Shangding [1 ]
Yang, Long [3 ]
Du, Yali [4 ]
Chen, Guang [5 ]
Walter, Florian [2 ]
Wang, Jun [6 ]
Knoll, Alois [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Tech Univ Munich, Dept Informat, D-85748 Munich, Germany
[3] Peking Univ, Inst AI, Beijing 100871, Peoples R China
[4] Kings Coll London, Dept Informat, London WC1E 6EB, England
[5] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[6] UCL, Dept Comp Sci, London WC1E 6BT, England
基金
中国国家自然科学基金;
关键词
Safe reinforcement learning (RL); safety optimisation; constrained Markov decision processes; safety problems; MARKOV DECISION-PROCESSES; ACTOR-CRITIC ALGORITHM; APPROXIMATION; MODEL; NETWORKS; POLICIES; CHAINS;
D O I
10.1109/TPAMI.2024.3457538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-making tasks. However, safety concerns are raised during deploying RL in real-world applications, leading to a growing demand for safe RL algorithms, such as in autonomous driving and robotics scenarios. While safe control has a long history, the study of safe RL algorithms is still in the early stages. To establish a good foundation for future safe RL research, in this paper, we provide a review of safe RL from the perspectives of methods, theories, and applications. First, we review the progress of safe RL from five dimensions and come up with five crucial problems for safe RL being deployed in real-world applications, coined as "2H3W". Second, we analyze the algorithm and theory progress from the perspectives of answering the "2H3W" problems. Particularly, the sample complexity of safe RL algorithms is reviewed and discussed, followed by an introduction to the applications and benchmarks of safe RL algorithms. Finally, we open the discussion of the challenging problems in safe RL, hoping to inspire future research on this thread. To advance the study of safe RL algorithms, we release an open-sourced repository containing major safe RL algorithms at the link.
引用
收藏
页码:11216 / 11235
页数:20
相关论文
共 50 条
  • [31] Safe Reinforcement Learning With Dual Robustness
    Li, Zeyang
    Hu, Chuxiong
    Wang, Yunan
    Yang, Yujie
    Li, Shengbo Eben
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10876 - 10890
  • [32] Safe reinforcement learning for dynamical games
    Yang, Yongliang
    Vamvoudakis, Kyriakos G.
    Modares, Hamidreza
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (09) : 3706 - 3726
  • [33] Safe Reinforcement Learning for Legged Locomotion
    Yang, Tsung-Yen
    Zhang, Tingnan
    Luu, Linda
    Ha, Sehoon
    Tan, Jie
    Yu, Wenhao
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 2454 - 2461
  • [34] Safe Reinforcement Learning via Shielding
    Alshiekh, Mohammed
    Bloem, Roderick
    Ehlers, Ruediger
    Koenighofer, Bettina
    Niekum, Scott
    Topcu, Ufuk
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2669 - 2678
  • [35] A Comprehensive Survey on Safe Reinforcement Learning
    Garcia, Javier
    Fernandez, Fernando
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 1437 - 1480
  • [36] Safe Reinforcement Learning in Uncertain Contexts
    Baumann, Dominik
    Schon, Thomas B.
    IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 : 1828 - 1841
  • [37] Safe Reinforcement Learning on Autonomous Vehicles
    Isele, David
    Nakhaei, Alireza
    Fujimura, Kikuo
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 6162 - 6167
  • [38] Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning
    Brunke, Lukas
    Greeff, Melissa
    Hall, Adam W.
    Yuan, Zhaocong
    Zhou, Siqi
    Panerati, Jacopo
    Schoellig, Angela P.
    ANNUAL REVIEW OF CONTROL ROBOTICS AND AUTONOMOUS SYSTEMS, 2022, 5 : 411 - 444
  • [39] A Review of Deep Reinforcement Learning Methods and Military Application Research
    Wang, Ning
    Li, Zhe
    Liang, Xiaolong
    Hou, Yueqi
    Yang, Aiwu
    Mathematical Problems in Engineering, 2023, 2023
  • [40] Multi-Agent Reinforcement Learning: A Review of Challenges and Applications
    Canese, Lorenzo
    Cardarilli, Gian Carlo
    Di Nunzio, Luca
    Fazzolari, Rocco
    Giardino, Daniele
    Re, Marco
    Spano, Sergio
    APPLIED SCIENCES-BASEL, 2021, 11 (11):