共 49 条
[5]
Choi S. P. M., 2001, Sequence learning. Paradigms, algorithms, and applications (Lecture Notes in Artificial Intelligence Vol.1828), P264
[6]
Choi S. P.-M., 2001, PMLR, P49
[7]
Choi SPM, 2000, ADV NEUR IN, V12, P987
[10]
A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (06)
:1291-1307