共 19 条
- [1] Ali H, 2019, ARXIV190801275
- [3] Chen XF, 2018, IEEE VTS VEH TECHNOL
- [4] Degris T, 2012, P AMER CONTR CONF, P2177
- [5] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1291 - 1307
- [6] Hill A., 2018, Stable baselines
- [7] Ives DJ, 2013, 2013 OPTICAL FIBER COMMUNICATION CONFERENCE AND EXPOSITION AND THE NATIONAL FIBER OPTIC ENGINEERS CONFERENCE (OFC/NFOEC)
- [8] Mitchell M., 1998, An Introduction to Genetic Algorithms. Complex Adaptive Systems
- [9] Mnih V, 2016, PR MACH LEARN RES, V48
- [10] Poggiolini P., 2015, ARXIV150304132