共 50 条
[41]
Reward-Free Policy Space Compression for Reinforcement Learning
[J].
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151,
2022, 151
[42]
Pessimistic Reward Models for Off-Policy Learning in Recommendation
[J].
15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021),
2021,
:63-74
[43]
Transfer Learning for Direct Policy Search: A Reward Shaping Approach
[J].
2013 IEEE THIRD JOINT INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL),
2013,
[46]
Sequence Prediction with Unlabeled Data by Reward Function Learning
[J].
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE,
2017,
:3098-3104
[48]
Survey of apprenticeship learning based on reward function approximating
[J].
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition),
2008, 36 (SUPPL. 1)
:288-290
[49]
Design of Reward Function on Reinforcement Learning for Automated Driving
[J].
IFAC PAPERSONLINE,
2023, 56 (02)
:7948-7953
[50]
Unsupervised Reinforcement Learning For Video Summarization Reward Function
[J].
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019),
2019,
:40-44