共 50 条
- [32] Off-Policy Conservative Distributional Reinforcement Learning With Safety Constraints IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (03): : 2033 - 2045
- [33] Policy Return: A New Method for Reducing the Number of Experimental Trials in Deep Reinforcement Learning IEEE ACCESS, 2020, 8 : 228099 - 228107
- [37] Enhanced Strategies for Off-Policy Reinforcement Learning Algorithms in HVAC Control 2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1691 - 1696
- [38] Re-attentive experience replay in off-policy reinforcement learning Machine Learning, 2024, 113 : 2327 - 2349
- [39] Model-free off-policy reinforcement learning in continuous environment 2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1091 - 1096
- [40] Cautious policy programming: exploiting KL regularization for monotonic policy improvement in reinforcement learning Machine Learning, 2023, 112 : 4527 - 4562