共 21 条
[1]
[Anonymous], 2018, SOFT ACTOR CRITIC OF
[2]
Baird L., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P30
[3]
Bhatnagar Shalabh, ADV NEURAL INFORM PR, V22, P1204
[4]
Espeholt L., 2018, ARXIV180201561
[5]
Heess N., EMERGENCE LOCOMOTION
[6]
Horgan Dan., 2018, Distributed Prioritized Experience Replay
[7]
Jaderberg M., Reinforcement learning with unsupervised auxiliary tasks
[8]
Juliani A., UNITY GEN PLATFORM I
[9]
Konda Vijay R, ADV NEURAL INFORM PR, P1008
[10]
LILLICRAP T P, Continuous control with deep reinforcement learning