共 21 条
- [1] [Anonymous], 2018, SOFT ACTOR CRITIC OF
- [2] Baird L., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P30
- [3] Bhatnagar Shalabh, ADV NEURAL INFORM PR, V22, P1204
- [4] Espeholt L., 2018, ARXIV180201561
- [5] Heess N., EMERGENCE LOCOMOTION
- [6] Horgan Dan., 2018, Distributed Prioritized Experience Replay
- [7] Jaderberg M., Reinforcement learning with unsupervised auxiliary tasks
- [8] Juliani A., UNITY GEN PLATFORM I
- [9] Konda Vijay R, ADV NEURAL INFORM PR, P1008
- [10] LILLICRAP T P, Continuous control with deep reinforcement learning