共 22 条
[1]
Berner C., 2019, Dota 2 with large scale deep reinforcement learning
[2]
Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning
[J].
WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018),
2018,
:1939-1948
[3]
Foerster JN, 2018, AAAI CONF ARTIF INTE, P2974
[4]
Garnett R, 2018, ADV NEURAL INFORM PR, V31, P8102
[5]
Guestrin C, 2002, ADV NEUR IN, V14, P1523
[6]
Non-convex optimization for machine learning
[J].
Foundations and Trends in Machine Learning,
2017, 10 (3-4)
:142-336
[8]
Ma Jinming, 2020, P 19 INT C AUT AG MU, P816