A comprehensive survey on safe reinforcement learning

被引：0

作者：

García, Javier ^{[1
]}

Fernández, Fernando ^{[1
]}

机构：

[1] Universidad Carlos III de Madrid, Avenida de la Universidad 30, Leganes, Madrid,28911, Spain

来源：

Journal of Machine Learning Research | 2015年 / 16卷

关键词：

Reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Safe Reinforcement Learning can be defined as the process of learning policies that maximize the expectation of the return in problems in which it is important to ensure reasonable system performance and/or respect safety constraints during the learning and/or deployment processes. We categorize and analyze two approaches of Safe Reinforcement Learning. The first is based on the modification of the optimality criterion, the classic discounted finite/infinite horizon, with a safety factor. The second is based on the modification of the exploration process through the incorporation of external knowledge or the guidance of a risk metric. We use the proposed classification to survey the existing literature, as well as suggesting future directions for Safe Reinforcement Learning. © 2015 Javier Garćia and Fernando Fernandez.

引用

页码：1437 / 1480

共 50 条

[1] A Comprehensive Survey on Safe Reinforcement Learning
Garcia, Javier
Fernandez, Fernando
JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 1437 - 1480
[2] Safe Reinforcement Learning: A Survey
Wang X.-S.
Wang R.-R.
Cheng Y.-H.
Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (09): : 1813 - 1835
[3] Hierarchical Reinforcement Learning: A Comprehensive Survey
Pateria, Shubham
Subagdja, Budhitama
Tan, Ah-hwee
Quek, Chai
ACM COMPUTING SURVEYS, 2021, 54 (05)
[4] A comprehensive survey of multiagent reinforcement learning
Busoniu, Lucian
Babuska, Robert
De Schutter, Bart
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
[5] A Survey of Constraint Formulations in Safe Reinforcement Learning
Wachi, Akifumi
Shen, Xun
Sui, Yanan
PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8262 - 8271
[6] Reinforcement learning in robotic applications: a comprehensive survey
Singh, Bharat
Kumar, Rajesh
Singh, Vinay Pratap
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (02) : 945 - 990
[7] Reinforcement Learning for IoT Security: A Comprehensive Survey
Uprety, Aashma
Rawat, Danda B.
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (11): : 8693 - 8706
[8] Reinforcement learning in robotic applications: a comprehensive survey
Bharat Singh
Rajesh Kumar
Vinay Pratap Singh
Artificial Intelligence Review, 2022, 55 : 945 - 990
[9] Safe reinforcement learning and its applications in robotics: A survey
Zhang C.-X.
Zhang X.-L.
Xu X.
Lu Y.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (12): : 2090 - 2103
[10] State-wise Safe Reinforcement Learning: A Survey
Zhao, Weiye
He, Tairan
Chen, Rui
Wei, Tianhao
Liu, Changliu
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6814 - 6822

← 1 2 3 4 5 →