共 36 条
[1]
[Anonymous], 1959, Information Theory and Statistics
[3]
Chen G., 2018, An adaptive clipping approach for proximal policy optimization
[4]
Chu X., 2018, POLICY OPTIMIZATION
[5]
Engstrom L., 2020, ARXIV
[6]
Fayjie AR, 2018, INT CONF UBIQ ROBOT, P896
[7]
Gupta Jayesh K., 2017, Autonomous Agents and Multiagent Systems, AAMAS 2017: Workshops, Best Papers. Revised Selected Papers: LNAI 10642, P66, DOI 10.1007/978-3-319-71682-4_5
[8]
Haarnoja T, 2018, PR MACH LEARN RES, V80
[9]
Hamalainen P., 2018, ARXIV181002541
[10]
Hessel M, 2018, AAAI CONF ARTIF INTE, P3215