共 39 条
[1]
Experience Replay for Real-Time Reinforcement Learning Control
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (02)
:201-212
[2]
Bertsekas DP, 1995, PROCEEDINGS OF THE 34TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P560, DOI 10.1109/CDC.1995.478953
[3]
Brockman G, 2016, Arxiv, DOI arXiv:1606.01540
[4]
Folkers A, 2019, IEEE INT VEH SYM, P2025, DOI [10.1109/ivs.2019.8814124, 10.1109/IVS.2019.8814124]
[6]
Fujimoto S., 2020, ADV NEURAL INFORM PR, P14219, DOI DOI 10.48550/ARXIV.2007.06049
[7]
Fujimoto S, 2018, PR MACH LEARN RES, V80
[8]
Gehring C., 2013, P 2013 INT C AUTONOM, P1037
[9]
A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (06)
:1291-1307
[10]
Haarnoja T, 2018, PR MACH LEARN RES, V80