共 35 条
[2]
Barth-Maron G, 2018, Arxiv, DOI [arXiv:1804.08617, DOI 10.48550/ARXIV.1804.08617]
[3]
Chishti SOA, 2018, 2018 IEEE 21ST INTERNATIONAL MULTI-TOPIC CONFERENCE (INMIC)
[4]
Cobbe K., 2020, arXiv
[5]
Espeholt L, 2018, PR MACH LEARN RES, V80
[6]
Fujimoto S, 2018, PR MACH LEARN RES, V80
[7]
Haarnoja T, 2018, PR MACH LEARN RES, V80
[8]
Hasselt H.V., 2010, P ADV NEUR INF PROC
[9]
Hausknecht M., 2015, arXiv
[10]
On actor-critic algorithms
[J].
SIAM JOURNAL ON CONTROL AND OPTIMIZATION,
2003, 42 (04)
:1143-1166