共 47 条
- [1] Belay Adam, 2014, Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI '14). OSDI '14, P49
- [2] Brandeburg J, 2017, TOOL MEASURING NIC B
- [3] Burns E, 2017, FACEBOOK USES DEEP L
- [4] Chen TQ, 2018, PROCEEDINGS OF THE 13TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P579
- [5] LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference [J]. 2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 493 - 506
- [6] PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2020), 2020, : 220 - 233
- [7] Corportion N, 2019, NVID TENS COR UNPR A
- [8] Crankshaw D, 2017, PROCEEDINGS OF NSDI '17: 14TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, P613
- [9] Cui WH, 2022, PROCEEDINGS OF THE 2022 USENIX ANNUAL TECHNICAL CONFERENCE, P183
- [10] Enable Simultaneous DNN Services Based on Deterministic Operator Overlap and Precise Latency Prediction [J]. SC21: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2021,