共 77 条
- [41] Nigade V., 2021, BETTER NEVER LATE TI, P426
- [42] Jellyfish: Timely Inference Serving for Dynamic Edge Networks [J]. 2022 IEEE 43RD REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2022), 2022, : 277 - 290
- [43] NVIDIA, NVIDIA Triton Inference Server Documentation
- [44] Olston Christopher, 2017, arXiv
- [45] PyTorch, 2022, Reproducibility
- [46] Pytorch, 2021, TORCHSERVE
- [47] Qu Zhe, 2022, ARXIV
- [48] Ran XK, 2018, IEEE INFOCOM SER, P1421, DOI 10.1109/INFOCOM.2018.8485905
- [49] FA2: Fast, Accurate Autoscaling for Serving Deep Learning Inference with SLA Guarantees [J]. 2022 IEEE 28TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM (RTAS), 2022, : 146 - 159