共 18 条
[1]
Experience Replay for Real-Time Reinforcement Learning Control
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (02)
:201-212
[2]
[Anonymous], INFORM TECHNOLOGY
[3]
GUEZ A, 2016, C AAAI
[4]
Hinton Geoffrey E., 2006, NEURAL COMPUTATION
[5]
Howard Ronald., 1966, Dynamic programming and Markov processes
[6]
Kingma DP, 2014, ARXIV
[7]
Lange S, 2012, IEEE IJCNN
[8]
li shihao, RES AUTOADAPTIVE CRU, DOI [10.16638/j.cnki.1671-7988.2018.23.064, DOI 10.16638/J.CNKI.1671-7988.2018.23.064]
[9]
Li Y, 2017, P ADV NEUR INF PROC, V30, P3812
[10]
Lillicrap TP, 2015, ARXIV150902971