共 53 条
[1]
BATCH: Machine Learning Inference Serving on Serverless Platforms with Adaptive Batching
[J].
PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20),
2020,
[2]
Amodei D, 2016, PR MACH LEARN RES, V48
[3]
[Anonymous], 2022, 19 USENIX S NETW SYS
[4]
Azevedo D., 2010, GREEN GRID, V32
[5]
Balancing Efficiency and Fairness in Heterogeneous GPU Clusters for Deep Learning
[J].
PROCEEDINGS OF THE FIFTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS'20),
2020,
[6]
Chilimbi Trishul, 2014, Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI '14). OSDI '14, P571
[7]
Chuangang Ren, 2012, 2012 IEEE 20th International Symposium on Modelling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), P391, DOI 10.1109/MASCOTS.2012.51
[8]
Crankshaw D, 2017, PROCEEDINGS OF NSDI '17: 14TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, P613
[9]
GeePS: Scalable deep learning on distributed GPUs with a GPU-specialized parameter server
[J].
PROCEEDINGS OF THE ELEVENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, (EUROSYS 2016),
2016,
[10]
Enable Simultaneous DNN Services Based on Deterministic Operator Overlap and Precise Latency Prediction
[J].
SC21: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS,
2021,