共 50 条
- [42] On the Power of Global Reward Signals in Reinforcement Learning MULTIAGENT SYSTEM TECHNOLOGIES, 2011, 6973 : 53 - +
- [44] DISCRIMINATION OF REWARD IN LEARNING WITH PARTIAL AND CONTINUOUS REINFORCEMENT JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1962, 64 (03): : 227 - &
- [45] Evolution of an Internal Reward Function for Reinforcement Learning PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 351 - 354
- [46] Reinforcement learning with nonstationary reward depending on the episode 2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2145 - 2150
- [47] Inverse Reinforcement Learning with the Average Reward Criterion ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [49] Balancing multiple sources of reward in reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 1082 - 1088