共 16 条
[1]
Aberdeen D., 2002, PROC INT C MACHINE L, P3
[2]
Aberdeen D.A, 2003, THESIS AUSTR NATL U
[4]
[Anonymous], 2010, SCHOLARPEDIA
[5]
[Anonymous], 2020, Reinforcement Learning, An Introduction
[7]
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f
[8]
Degris Thomas, 2012, P 29 INT COFERENCE I
[10]
On actor-critic algorithms
[J].
SIAM JOURNAL ON CONTROL AND OPTIMIZATION,
2003, 42 (04)
:1143-1166