共 50 条
- [1] TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 2621 - 2628
- [2] Safe Off-policy Reinforcement Learning Using Barrier Functions 2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2176 - 2181
- [6] Sequential Search with Off-Policy Reinforcement Learning PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4006 - 4015
- [9] Off-policy and on-policy reinforcement learning with the Tsetlin machine Applied Intelligence, 2023, 53 : 8596 - 8613