共 24 条
- [3] Auer P., 2006, P 20 ANN C NEUR INF, P49
- [4] Brafman R. I., 2002, RES, V3, P213
- [6] Castronovo Michael., 2013, PMLR, P1
- [8] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1291 - 1307
- [10] Jaksch T, 2010, J MACH LEARN RES, V11, P1563