共 50 条
- [43] A Modified Average Reward Reinforcement Learning Based on Fuzzy Reward Function [J]. IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 113 - 117
- [44] Skill Reward for Safe Deep Reinforcement Learning [J]. UBIQUITOUS SECURITY, 2022, 1557 : 203 - 213
- [45] On the Power of Global Reward Signals in Reinforcement Learning [J]. MULTIAGENT SYSTEM TECHNOLOGIES, 2011, 6973 : 53 - +
- [47] Reinforcement learning with nonstationary reward depending on the episode [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2145 - 2150
- [48] Shaping reward learning approach from passive samples [J]. Ruan Jian Xue Bao/Journal of Software, 2013, 24 (11): : 2667 - 2675
- [50] Learning Robot Manipulation based on Modular Reward Shaping [J]. 11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 883 - 886