共 50 条
- [1] Off-Policy Evaluation via the Regularized Lagrangian ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [2] Universal Off-Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [3] Off-Policy Evaluation for Human Feedback ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [4] Off-policy evaluation for slate recommendation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [5] High Confidence Off-Policy Evaluation PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3000 - 3006
- [6] State Relevance for Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [7] Evaluating the Robustness of Off-Policy Evaluation 15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 114 - 123
- [8] Off-Policy Proximal Policy Optimization THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9162 - 9170
- [9] A Nonparametric Off-Policy Policy Gradient INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [10] Representation Balancing MDPs for Off-Policy Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31