共 50 条
- [1] Safe and efficient off-policy reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [2] Bounds for Off-policy Prediction in Reinforcement Learning 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3991 - 3997
- [4] Representations for Stable Off-Policy Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [6] On the Reuse Bias in Off-Policy Reinforcement Learning PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4513 - 4521
- [7] A perspective on off-policy evaluation in reinforcement learning Frontiers of Computer Science, 2019, 13 : 911 - 912
- [9] Sequential Search with Off-Policy Reinforcement Learning PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4006 - 4015
- [10] Representations for Stable Off-Policy Reinforcement Learning 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,