共 49 条
[31]
Roughan M., 2003, P 3 ACM SIGCOMM C IN, P248
[34]
Schulman John, 2017, Proximal policy optimization algorithms
[36]
Stampa G, 2017, DEEP REINFORCEMENT L
[37]
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[38]
Thaler D., 2000, MULTIPATH ISSUES UNI
[39]
Learning To Route
[J].
HOTNETS-XVI: PROCEEDINGS OF THE 16TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS,
2017,
:185-191