共 127 条
[34]
Gottipati S. K., P 37 INT C MACH LEAR
[36]
A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (06)
:1291-1307
[38]
Guimaraes GL, 2018, Arxiv, DOI arXiv:1705.10843
[40]
Henault E. S., 2020, PEERJ PHYS CHEM, V2