共 50 条
- [23] Off-Policy Conservative Distributional Reinforcement Learning With Safety Constraints IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (03): : 2033 - 2045
- [24] HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement Learning Agents IEEE ACCESS, 2024, 12 : 100102 - 100119
- [26] A General Technique to Combine Off-Policy Reinforcement Learning Algorithms with Satellite Attitude Control PROCEEDINGS OF 2019 CHINESE INTELLIGENT AUTOMATION CONFERENCE, 2020, 586 : 709 - 719
- [27] Optimal Control of Iron-Removal Systems Based on Off-Policy Reinforcement Learning IEEE ACCESS, 2020, 8 (08): : 149730 - 149740
- [28] Enhanced Strategies for Off-Policy Reinforcement Learning Algorithms in HVAC Control 2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1691 - 1696
- [29] Model-free off-policy reinforcement learning in continuous environment 2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1091 - 1096
- [30] Research on Experience Replay of Off-policy Deep Reinforcement Learning: A Review Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2237 - 2256