共 25 条
- [1] [Anonymous], 2015, COMPUTER SCI
- [2] [Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
- [3] [Anonymous], Prox. Policy Optim. Algorithms
- [4] Toward Self-Driving Bicycles Using State-of-the-Art Deep Reinforcement Learning Algorithms [J]. SYMMETRY-BASEL, 2019, 11 (02):
- [5] Hamalainen P., 2018, PPO CMA PROXIMAL POL
- [6] Imitation Reinforcement Learning-Based Remote Rotary Inverted Pendulum Control in OpenFlow Network [J]. IEEE ACCESS, 2019, 7 : 36682 - 36690
- [7] Kim S. K., IEEE T CIRCUITS SYST
- [8] King DB, 2015, ACS SYM SER, V1214, P1