Enabling Efficient Spatio-Temporal GPU Sharing for Network Function Virtualization

被引:2
|
作者
Zeng, Deze [1 ]
Zhu, Andong [2 ]
Gu, Lin [3 ]
Li, Peng [4 ]
Chen, Quan [5 ]
Guo, Minyi [5 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[4] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, Japan
[5] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
关键词
Graphics processing units; Task analysis; Kernel; Scheduling; Resource management; Concurrent computing; Computer architecture; GPU acceleration; network function virtualization; spatio-temporal sharing; task scheduling;
D O I
10.1109/TC.2023.3278541
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
By leveraging standard IT virtualization technology and Commercial-Off-The-Shelf (COTS) servers, Network Function Virtualization (NFV) decouples network functions from proprietary hardware devices for flexible service provisioning. But the potential of NFV is significantly limited by its performance inefficiency. With the unparalleled advantages of multi-core parallelism and high memory bandwidth, Graphics Processing Units (GPUs) are regarded as a promising way to accelerate Virtualized Network Functions (VNF). However, the special architecture of GPU brings new challenges to task scheduling and resource allocation. To this end, we propose a GPU oriented spatio-temporal sharing framework for NFV called Gost, aiming for GPU based VNF performance promotion. The execution order and GPU resource allocation (i.e., the number of threads) are considered in task scheduling to minimize the end-to-end latency for VNF flows. First, we formulate the task scheduling problem into a nonlinear programming form, and then transform it into an equivalent Integer Linear Programming (ILP) form. The problem is proved as NP-hard. We customize the classical list scheduling algorithm and propose a List Scheduling based Spatio-Temporal GPU sharing strategy (LSSTG), whose achievable worst-case performance is also formally analyzed. We practically implement Gost prototype, based on which extensive experiments verify the high performance efficiency of LSSTG compared to state-of-the-art in terms of latency and throughput.
引用
收藏
页码:2963 / 2977
页数:15
相关论文
共 50 条
  • [1] Gost: Enabling Efficient Spatio-Temporal GPU Sharing for Network Function Virtualization
    Zhu, Andong
    Zeng, Deze
    Gu, Lin
    Li, Peng
    Chen, Quan
    2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [2] FaST-GShare: Enabling Efficient Spatio-Temporal GPU Sharing in Serverless Computing for Deep Learning Inference
    Jianfeng Gu T
    Zhu, Yichao
    Wang, Puxuan
    Chadha, Mohak
    Gerndt, Michael
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 635 - 644
  • [3] Enabling Blockchain for Efficient Spatio-Temporal Query Processing
    Nurgaliev, Ildar
    Muzammal, Muhammad
    Qu, Qiang
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2018, PT I, 2018, 11233 : 36 - 51
  • [4] Efficient spatio-temporal network for action recognition
    Su, Yanxiong
    Zhao, Qian
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (05)
  • [5] Serving DNN Inference With Fine-Grained Spatio-Temporal Sharing of GPU Servers
    Peng, Yaqiong
    Gao, Weiguo
    Peng, Haocheng
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (06) : 4310 - 4323
  • [6] InSS: An Intelligent Scheduling Orchestrator for Multi-GPU Inference With Spatio-Temporal Sharing
    Han, Ziyi
    Zhou, Ruiting
    Xu, Chengzhong
    Zeng, Yifan
    Zhang, Renli
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (10) : 1735 - 1748
  • [7] Enabling Spatio-Temporal Search in Open Data
    Neumaier, Sebastian
    Polleres, Axel
    JOURNAL OF WEB SEMANTICS, 2019, 55 : 21 - 36
  • [8] Serving Heterogeneous Machine Learning Models on Multi-GPU Servers with Spatio-Temporal Sharing
    Choi, Seungbeom
    Lee, Sunho
    Kim, Yeonjae
    Park, Jongse
    Kwon, Youngjin
    Huh, Jaehyuk
    PROCEEDINGS OF THE 2022 USENIX ANNUAL TECHNICAL CONFERENCE, 2022, : 199 - 215
  • [9] An efficient spatio-temporal index for spatio-temporal query in wireless sensor networks
    Lee, Donhee
    Yoon, Kyoungro
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (10): : 4888 - 4908
  • [10] Modeling Temporal Dynamics with Function Approximation in Deep Spatio-Temporal Inference Network
    Karnowski, Thomas P.
    Arel, Itamar
    Young, Steven
    BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES 2011, 2011, 233 : 174 - +