共 34 条
[21]
Mitchell T M., 1997, Machine learning, International Edition
[22]
Mnih V, 2013, Arxiv, DOI arXiv:1312.5602
[24]
Delay-Optimal Traffic Engineering through Multi-agent Reinforcement Learning
[J].
IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM 2019 WKSHPS),
2019,
:435-442
[25]
Engineering Egress with Edge Fabric Steering Oceans of Content to the World
[J].
SIGCOMM '17: PROCEEDINGS OF THE 2017 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION,
2017,
:418-431
[26]
Schulman J, 2017, Arxiv, DOI [arXiv:1707.06347, DOI 10.48550/ARXIV.1707.06347]
[28]
Valadarsky A., 2017, PROC 31 C NEURAL INF, P1
[29]
van Hasselt H., 2010, DOUBLE Q LEARNING PA, P2613
[30]
Wang ZY, 2016, PR MACH LEARN RES, V48