共 64 条
[1]
Experience Replay for Real-Time Reinforcement Learning Control
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (02)
:201-212
[2]
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2008, 38 (04)
:943-949
[3]
Anderson C.W., 2015, 2015 International Joint Conference on Neural Networks IJCNN, P1, DOI DOI 10.1109/IJCNN.2015.7280824
[4]
[Anonymous], 2015, DEEP REINFORCEMENT L
[5]
[Anonymous], 2015, ARXIV150906461
[6]
[Anonymous], 2016, P 4 INT C LEARN REPR
[7]
[Anonymous], 1998, INTRO REINFORCEMENT
[8]
[Anonymous], GUEST POST 1
[9]
[Anonymous], 2016, IEEE RSJ INT C INT R
[10]
[Anonymous], 2016, ARXIV160300748