A Review of Safe Reinforcement Learning: Methods, Theories, and Applications

被引:4
|
作者
Gu, Shangding [1 ]
Yang, Long [3 ]
Du, Yali [4 ]
Chen, Guang [5 ]
Walter, Florian [2 ]
Wang, Jun [6 ]
Knoll, Alois [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Tech Univ Munich, Dept Informat, D-85748 Munich, Germany
[3] Peking Univ, Inst AI, Beijing 100871, Peoples R China
[4] Kings Coll London, Dept Informat, London WC1E 6EB, England
[5] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[6] UCL, Dept Comp Sci, London WC1E 6BT, England
基金
中国国家自然科学基金;
关键词
Safe reinforcement learning (RL); safety optimisation; constrained Markov decision processes; safety problems; MARKOV DECISION-PROCESSES; ACTOR-CRITIC ALGORITHM; APPROXIMATION; MODEL; NETWORKS; POLICIES; CHAINS;
D O I
10.1109/TPAMI.2024.3457538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-making tasks. However, safety concerns are raised during deploying RL in real-world applications, leading to a growing demand for safe RL algorithms, such as in autonomous driving and robotics scenarios. While safe control has a long history, the study of safe RL algorithms is still in the early stages. To establish a good foundation for future safe RL research, in this paper, we provide a review of safe RL from the perspectives of methods, theories, and applications. First, we review the progress of safe RL from five dimensions and come up with five crucial problems for safe RL being deployed in real-world applications, coined as "2H3W". Second, we analyze the algorithm and theory progress from the perspectives of answering the "2H3W" problems. Particularly, the sample complexity of safe RL algorithms is reviewed and discussed, followed by an introduction to the applications and benchmarks of safe RL algorithms. Finally, we open the discussion of the challenging problems in safe RL, hoping to inspire future research on this thread. To advance the study of safe RL algorithms, we release an open-sourced repository containing major safe RL algorithms at the link.
引用
收藏
页码:11216 / 11235
页数:20
相关论文
共 50 条
  • [41] Review of deep reinforcement learning and its applications in military field
    Zhang M.
    Dou Y.
    Chen Z.
    Jiang J.
    Yang K.
    Ge B.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (04): : 1297 - 1308
  • [42] Review of reinforcement learning applications in segmentation, chemotherapy, and radiotherapy of cancer
    Khajuria, Rishi
    Sarwar, Abid
    MICRON, 2024, 178
  • [43] A review on reinforcement learning algorithms and applications in supply chain management
    Rolf, Benjamin
    Jackson, Ilya
    Mueller, Marcel
    Lang, Sebastian
    Reggelin, Tobias
    Ivanov, Dmitry
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (20) : 7151 - 7179
  • [44] A review of reinforcement learning for natural language processing and applications in healthcare
    Liu, Ying
    Wang, Haozhu
    Zhou, Huixue
    Li, Mingchen
    Hou, Yu
    Zhou, Sicheng
    Wang, Fang
    Hoetzlein, Rama
    Zhang, Rui
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (10) : 2379 - 2393
  • [45] A review of reinforcement learning applications in adaptive traffic signal control
    Miletic, Mladen
    Ivanjko, Edouard
    Greguric, Martin
    Kusic, Kresimir
    IET INTELLIGENT TRANSPORT SYSTEMS, 2022, 16 (10) : 1269 - 1285
  • [46] Applications of reinforcement learning for building energy efficiency control: A review
    Fu, Qiming
    Han, Zhicong
    Chen, Jianping
    Lu, You
    Wu, Hongjie
    Wang, Yunzhe
    JOURNAL OF BUILDING ENGINEERING, 2022, 50
  • [47] A Review on the Applications of Reinforcement Learning Control for Power Electronic Converters
    Chen, Peng
    Zhao, Jianfeng
    Liu, Kangli
    Zhou, Jingyang
    Dong, Kun
    Li, Yufan
    Guo, Xirui
    Pan, Xin
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2024, 60 (06) : 8430 - 8450
  • [48] Review on Applications of Deep Reinforcement Learning in Regulation of Microgrid Systems
    Zhang Y.
    Lin Y.
    Huang G.
    Yang X.
    Weng G.
    Zhou Z.
    Dianwang Jishu/Power System Technology, 2023, 47 (07): : 2775 - 2787
  • [49] A Review of Multi-agent Reinforcement Learning Theory and Applications
    Chen, Zhuoran
    Liu, Zeyang
    Wan, Lipeng
    Chen, Xingyu
    Zhu, Yameng
    Wang, Chengze
    Cheng, Xiang
    Zhang, Ya
    Zhang, Senlin
    Wang, Xiaohui
    Lan, Xuguang
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (10): : 851 - 872
  • [50] Review and Evaluation of Reinforcement Learning Frameworks on Smart Grid Applications
    Vamvakas, Dimitrios
    Michailidis, Panagiotis
    Korkas, Christos
    Kosmatopoulos, Elias
    ENERGIES, 2023, 16 (14)