共 50 条
- [1] Conformal Off-Policy Prediction in Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [2] Conformal Off-Policy Evaluation in Markov Decision Processes 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3087 - 3094
- [3] Importance Resampling for Off-policy Prediction ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [4] Off-Policy Evaluation via Off-Policy Classification ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [5] Bounds for Off-policy Prediction in Reinforcement Learning 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3991 - 3997
- [6] Prediction in Intelligence: An Empirical Comparison of Off-policy Algorithms on Robots AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 332 - 340
- [8] Off-Policy Proximal Policy Optimization THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9162 - 9170
- [9] A Nonparametric Off-Policy Policy Gradient INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [10] Boosted Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206