共 41 条
- [1] BATCH: Machine Learning Inference Serving on Serverless Platforms with Adaptive Batching [J]. PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
- [2] [Anonymous], Triton inference serving
- [3] [Anonymous], TensorRT
- [4] Trading Private Range Counting over Big IoT Data [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 144 - 153
- [5] A Private and Efficient Mechanism for Data Uploading in Smart Cyber-Physical Systems [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (02): : 766 - 775
- [8] LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference [J]. 2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 493 - 506
- [9] Crankshaw Daniel, 2020, SoCC '20: Proceedings of the 11th ACM Symposium on Cloud Computing, P477, DOI 10.1145/3419111.3421285
- [10] Crankshaw D., 2018, Queue, V16, P83