共 40 条
[1]
[Anonymous], 2017, ADV NEURAL INFORM PR
[2]
[Anonymous], 2017, P INT C LEARN REPR
[3]
[Anonymous], 1995, J. Int. Comput. Games Assoc.
[4]
[Anonymous], 2012, P MACHINE LEARNING R
[5]
[Anonymous], 2014, INT C MACH LEARN
[6]
[Anonymous], 1995, PID CONTROLLERS THEO
[7]
[Anonymous], 2015, P INT C LEARN REPR I
[9]
A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (06)
:1291-1307
[10]
Houthooft Rein, 2016, ADV NEURAL INFORM PR, V29