共 64 条
[51]
Si J., 2004, HDB LEARNING APPROXI
[53]
Springenberg JT., 2015, STRIVING SIMPLICITY, DOI DOI 10.48550/ARXIV.1412.6806
[54]
Tieleman T., 2012, COURSERA NEURAL NETW, V4, P26, DOI DOI 10.1007/S12654-012-0173-1
[55]
Wang B, 2016, IEEE IJCNN, P3550, DOI 10.1109/IJCNN.2016.7727655
[59]
Autonomous reinforcement learning with experience replay
[J].
NEURAL NETWORKS,
2013, 41
:156-167