A comprehensive survey on safe reinforcement learning

被引:0
|
作者
García, Javier [1 ]
Fernández, Fernando [1 ]
机构
[1] Universidad Carlos III de Madrid, Avenida de la Universidad 30, Leganes, Madrid,28911, Spain
关键词
Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
Safe Reinforcement Learning can be defined as the process of learning policies that maximize the expectation of the return in problems in which it is important to ensure reasonable system performance and/or respect safety constraints during the learning and/or deployment processes. We categorize and analyze two approaches of Safe Reinforcement Learning. The first is based on the modification of the optimality criterion, the classic discounted finite/infinite horizon, with a safety factor. The second is based on the modification of the exploration process through the incorporation of external knowledge or the guidance of a risk metric. We use the proposed classification to survey the existing literature, as well as suggesting future directions for Safe Reinforcement Learning. © 2015 Javier Garćia and Fernando Fernandez.
引用
收藏
页码:1437 / 1480
相关论文
共 50 条
  • [1] A Comprehensive Survey on Safe Reinforcement Learning
    Garcia, Javier
    Fernandez, Fernando
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 1437 - 1480
  • [2] Safe Reinforcement Learning: A Survey
    Wang X.-S.
    Wang R.-R.
    Cheng Y.-H.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (09): : 1813 - 1835
  • [3] Hierarchical Reinforcement Learning: A Comprehensive Survey
    Pateria, Shubham
    Subagdja, Budhitama
    Tan, Ah-hwee
    Quek, Chai
    ACM COMPUTING SURVEYS, 2021, 54 (05)
  • [4] A comprehensive survey of multiagent reinforcement learning
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
  • [5] A Survey of Constraint Formulations in Safe Reinforcement Learning
    Wachi, Akifumi
    Shen, Xun
    Sui, Yanan
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8262 - 8271
  • [6] Reinforcement learning in robotic applications: a comprehensive survey
    Singh, Bharat
    Kumar, Rajesh
    Singh, Vinay Pratap
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (02) : 945 - 990
  • [7] Reinforcement Learning for IoT Security: A Comprehensive Survey
    Uprety, Aashma
    Rawat, Danda B.
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11): : 8693 - 8706
  • [8] Reinforcement learning in robotic applications: a comprehensive survey
    Bharat Singh
    Rajesh Kumar
    Vinay Pratap Singh
    Artificial Intelligence Review, 2022, 55 : 945 - 990
  • [9] Safe reinforcement learning and its applications in robotics: A survey
    Zhang C.-X.
    Zhang X.-L.
    Xu X.
    Lu Y.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (12): : 2090 - 2103
  • [10] State-wise Safe Reinforcement Learning: A Survey
    Zhao, Weiye
    He, Tairan
    Chen, Rui
    Wei, Tianhao
    Liu, Changliu
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6814 - 6822