共 27 条
[1]
[Anonymous], 2018, ARXIV180801977
[2]
[Anonymous], 2014, CISC VIS NETW IND GL
[3]
Bakker B, 2002, ADV NEUR IN, V14, P1475
[6]
Chen X., 2016, IEEE ACM T NETWORKIN, V24
[7]
Chen X., 2018, ARXIV180400514
[9]
A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (06)
:1291-1307
[10]
Hausknecht M., 2015, P AAAI FALL S SER