共 67 条
[31]
In-Datacenter Performance Analysis of a Tensor Processing Unit
[J].
44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017),
2017,
:1-12
[32]
Reinforcement learning: A survey
[J].
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH,
1996, 4
:237-285
[33]
Kingma DP, 2014, ADV NEUR IN, V27
[34]
On actor-critic algorithms
[J].
SIAM JOURNAL ON CONTROL AND OPTIMIZATION,
2003, 42 (04)
:1143-1166
[35]
Latva-aho M., 2020, KEY DRIVERS RES CHAL
[36]
Applications of Deep Reinforcement Learning in Communications and Networking: A Survey
[J].
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS,
2019, 21 (04)
:3133-3174
[37]
Maas AL, 2013, ICML, P30
[38]
An Intelligent and Adaptive Threshold-Based Schema for Energy and Performance Efficient Dynamic VM Consolidation
[J].
ENERGY EFFICIENCY IN LARGE SCALE DISTRIBUTED SYSTEMS, EE-LSDS 2013,
2013, 8046
:85-97
[39]
Miyazawa Takaya, 2017, 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM), P428, DOI 10.23919/INM.2017.7987308
[40]
Mnih V., 2016, PROC INT C MACH LEAR, P1928