共 50 条
- [31] Off-Policy Reinforcement Learning with Delayed Rewards INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [32] Adaptive Auxiliary Task Weighting for Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [33] Learning Circuit Placement Techniques through Reinforcement Learning with Adaptive Rewards 2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
- [35] Split Q Learning: Reinforcement Learning with Two-Stream Rewards PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6448 - 6449
- [37] Exploring selfish reinforcement learning in repeated games with stochastic rewards Autonomous Agents and Multi-Agent Systems, 2007, 14 : 239 - 269
- [38] REVERSAL LEARNING IN A SUCCESSIVE DISCRIMINATION USING INTERMITTENT REINFORCEMENT JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1970, 84 (01): : 181 - &
- [39] No-Regret Reinforcement Learning with Heavy-Tailed Rewards 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [40] Potential-Based Difference Rewards for Multiagent Reinforcement Learning AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2014, : 165 - 172