共 17 条
- [1] Andrychowicz M., 2017, Advances in neural information processing systems, P5048
- [3] De Bruin T., 2015, NIPS DEEP REINF LEAR
- [4] Horgan Dan, 2018, DISTRIBUTED PRIORITI
- [5] Hou Y, 2017, IEEE INT C SYST
- [6] Ke Fengzhen, COMPUTER ENG APPL
- [7] Lanka S., 2018, ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay
- [8] Li Y, 2017, P ADV NEUR INF PROC, V30, P3812
- [9] Lillicrap TP, 2015, ARXIV150902971