共 50 条
- [32] Enhanced Strategies for Off-Policy Reinforcement Learning Algorithms in HVAC Control 2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1691 - 1696
- [34] On the Convergence of Temporal-Difference Learning with Linear Function Approximation Machine Learning, 2001, 42 : 241 - 267
- [35] Off-Policy Learning-to-Bid with AuctionGym PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 4219 - 4228
- [40] Sequential Search with Off-Policy Reinforcement Learning PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4006 - 4015