共 48 条
[1]
Achiam J., 2020, Twin delayed DDPG
[3]
Alex J., 2008, BENCHMARK SIMULATION
[5]
Bilgin E., 2020, Mastering Reinforcement Learning with Python
[6]
Bishop C.M., 1995, Neural networks for pattern recognition
[7]
Brockman Greg, 2016, arXiv
[9]
Chan S.C., 2019, Measuring the reliability of reinforcement learning algorithms. arXiv