共 50 条
- [42] Quasi-Stochastic Approximation and Off-Policy Reinforcement Learning 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 5244 - 5251
- [43] Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4674 - 4679
- [44] Conformal Off-Policy Prediction INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [45] Research on Experience Replay of Off-policy Deep Reinforcement Learning: A Review Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2237 - 2256
- [46] Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [47] Model-free off-policy reinforcement learning in continuous environment 2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1091 - 1096
- [48] Boosted Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [49] Re-attentive experience replay in off-policy reinforcement learning Machine Learning, 2024, 113 : 2327 - 2349
- [50] VALUE-AWARE IMPORTANCE WEIGHTING FOR OFF-POLICY REINFORCEMENT LEARNING CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 745 - 763