NEPTUNE: A Comprehensive Framework for Managing Serverless Functions at the Edge

被引:8
作者
Baresi, Luciano [1 ]
Hu, Davide Yi Xian [1 ]
Quattrocchi, Giovanni [1 ]
Terracciano, Luca [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, Via Golgi 42, I-20133 Milan, Italy
关键词
Serverless; edge computing; GPU; placement; vertical scaling; kubernetes; k3s; dynamic resource allocation; control theory; CONSTRAINTS; PLACEMENT; NETWORK; 5G;
D O I
10.1145/3634750
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applications that are constrained by low-latency requirements can hardly be executed on cloud infrastructures, given the high network delay required to reach remote servers. Multi-access Edge Computing (MEC) is the reference architecture for executing applications on nodes that are located close to users (i.e., at the edge of the network). This way, the network overhead is reduced but new challenges emerge. The resources available on edge nodes are limited, workloads fluctuate since users can rapidly change location, and complex tasks are becoming widespread (e.g., machine learning inference). To address these issues, this article presents NEPTUNE, a serverless-based framework that automates the management of large-scale MEC infrastructures. In particular, NEPTUNE provides (i) the placement of serverless functions on MEC nodes according to users' location, (ii) the resolution of resource contention scenarios by avoiding that single nodes be saturated, and (iii) the dynamic allocation of CPUs and GPUs tomeet foreseen execution times. To assess NEPTUNE, we built a prototype based on K3S, an edge-dedicated version of Kubernetes, and executed a comprehensive set of experiments. Results show that NEPTUNE obtains a significant reduction in terms of response time, network overhead, and resource consumption compared with five state-of-the-art solutions.
引用
收藏
页数:32
相关论文
共 64 条
[1]  
[Anonymous], 1981, RFC 792 (Internet Standard), DOI DOI 10.17487/RFC0792
[2]   Resource Provisioning and Allocation in Function-as-a-Service Edge-Clouds [J].
Ascigil, Onur ;
Tasiopoulos, Argyrios G. ;
Truong Khoa Phan ;
Sourlas, Vasilis ;
Psaras, Ioannis ;
Pavlou, George .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (04) :2410-2424
[3]   Adaptive scaling of Kubernetes pods [J].
Balla, David ;
Simon, Csaba ;
Maliosz, Markosz .
NOMS 2020 - PROCEEDINGS OF THE 2020 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2020: MANAGEMENT IN THE AGE OF SOFTWARIZATION AND ARTIFICIAL INTELLIGENCE, 2020,
[4]   NEPTUNE: Network- and GPU-aware Management of Serverless Functions at the Edge [J].
Baresi, Luciano ;
Hu, Davide Yi Xian ;
Quattrocchi, Giovanni ;
Terracciano, Luca .
2022 17TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS (SEAMS), 2022, :144-155
[5]   KOSMOS: Vertical and Horizontal Resource Autoscaling for Kubernetes [J].
Baresi, Luciano ;
Hu, Davide Yi Xian ;
Quattrocchi, Giovanni ;
Terracciano, Luca .
SERVICE-ORIENTED COMPUTING (ICSOC 2021), 2021, 13121 :821-829
[6]   COCOS: a Scalable Architecture for Containerized Heterogeneous Systems [J].
Baresi, Luciano ;
Quattrocchi, Giovanni .
IEEE 17TH INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE (ICSA 2020), 2020, :103-113
[7]   PAPS: A Framework for Decentralized Self-management at the Edge [J].
Baresi, Luciano ;
Mendonca, Danilo Filgueira ;
Quattrocchi, Giovanni .
SERVICE-ORIENTED COMPUTING (ICSOC 2019), 2019, 11895 :508-522
[8]   Classification of optimization problems in fog computing [J].
Bellendorf, Julian ;
Mann, Zoltan Adam .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 107 :158-176
[9]   AuctionWhisk: Using an auction-inspired approach for function placement in serverless fog platforms [J].
Bermbach, David ;
Bader, Jonathan ;
Hasenburg, Jonathan ;
Pfandzelter, Tobias ;
Thamsen, Lauritz .
SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (05) :1143-1169
[10]   GPU-based pedestrian detection for autonomous driving [J].
Campmany, V. ;
Silva, S. ;
Espinosa, A. ;
Moure, J. C. ;
Vazquez, D. ;
Lopez, A. M. .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 :2377-2381