共 50 条
[21]
Model-free Policy Learning with Reward Gradients
[J].
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151,
2022, 151
[22]
Dynamic Adjustment of Reward Function for Proximal Policy Optimization with Imitation Learning: Application to Automated Parking Systems
[J].
2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV),
2022,
:1400-1408
[24]
CHILDRENS DISCRIMINATION LEARNING AS A FUNCTION OF REWARD AND PUNISHMENT
[J].
JOURNAL OF COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY,
1961, 54 (04)
:449-&
[25]
Evolution of an Internal Reward Function for Reinforcement Learning
[J].
PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION,
2023,
:351-354
[26]
Active reward learning with a novel acquisition function
[J].
Autonomous Robots,
2015, 39
:389-405
[27]
LEARNING IN HONEYBEES AS A FUNCTION OF AMOUNT AND FREQUENCY OF REWARD
[J].
ANIMAL LEARNING & BEHAVIOR,
1988, 16 (03)
:247-255
[28]
A Humanoid Robot Standing Up Through Learning from Demonstration Using a Multimodal Reward Function
[J].
2013 13TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS),
2013,
:74-79
[29]
Average-Reward Off-Policy Policy Evaluation with Function Approximation
[J].
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139,
2021, 139