共 15 条
[1]
[Anonymous], 2018, REINFORCEMENT LEARNI
[2]
[Anonymous], 1993, P 1993 CONN MOD SUMM
[3]
Cheung V, 2016, OPENAI GYM
[4]
Cornish Christopher John, 1989, (Ph.D. thesis
[5]
Fujimoto S, 2018, PR MACH LEARN RES, V80
[6]
Kingma DP, 2014, ADV NEUR IN, V27
[7]
Lillicrap T. P., 2016, CoRR, abs/1509.02971, P1
[9]
Mnih V., 2013, Asynchronous methods for deep reinforcement learning, V1312, P5602