An Automated Pipeline for Advanced Fault Tolerance in Edge Computing Infrastructures

被引:10
作者
Theodoropoulos, Theodoros
Makris, Antonios [1 ]
Violos, John [1 ]
Tserpes, Konstantinos [1 ]
机构
[1] Harokopio Univ Athens, Dept Informat & Telemat, Athens, Greece
来源
2ND WORKSHOP ON FLEXIBLE RESOURCE AND APPLICATION MANAGEMENT ON THE EDGE, FRAME 2022 | 2022年
关键词
edge computing; fault tolerance; multi-step prediction; cloud computing; internet of things; horizontal auto-scaling; deep learning; PREDICTION;
D O I
10.1145/3526059.3533623
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The very fabric of Edge Computing is intertwined with the necessity to be able to orchestrate and manage a huge number of heterogeneous computational resources. On top of that, the rather demanding Quality of Service (QoS) requirements of Internet of Things (IoT) applications that run on these resources, dictate that it is essential to establish robust Fault Tolerance mechanisms. These mechanisms should be able to guarantee that the requirements will be upheld regardless of any potential changes in task production rate. To that end, we suggest an Automated Pipeline for Advanced Fault Tolerance (APAFT) that consists of various components that are designed to operate as functional blocks of an automated closed-control loop. Furthermore, the suggested pipeline is able to carry out the various Horizontal Scaling operations in a proactive manner. These Proactive Scaling capabilities are achieved via the use of a dedicated Deep Learning (DL)-based component that is able to perform multi-step prediction. Our work aims to introduce a number of mechanisms that are able to leverage the benefits that are provided by the multi-step format in a more refined manner. Having access to information regarding multiple future instances allows us to design automated resource orchestration strategies that cater to the specific characteristics of each type of computational node that is part of the Edge Infrastructure.
引用
收藏
页码:19 / 24
页数:6
相关论文
共 20 条
[1]   IoT traffic prediction using multi-step ahead prediction with neural network [J].
Abdellah, Ali R. ;
Mahmood, Omar Abdul Kareem ;
Paramonov, Alexander ;
Koucheryavy, Andrey .
2019 11TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2019,
[2]  
Ataallah SMA, 2015, 2015 11TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), P241, DOI 10.1109/ICENCO.2015.7416355
[3]   Interactive Temporal Recurrent Convolution Network for Traffic Predictionin Data Centers [J].
Cao, Xiaofeng ;
Zhong, Yuhua ;
Zhou, Yun ;
Wang, Jiang ;
Zhu, Cheng ;
Zhang, Weiming .
IEEE ACCESS, 2018, 6 :5276-5289
[4]   Replication-Based Fault-Tolerance for Large-Scale Graph Processing [J].
Chen, Rong ;
Yao, Youyang ;
Wang, Peng ;
Zhang, Kaiyuan ;
Wang, Zhaoguo ;
Guan, Haibing ;
Zang, Binyu ;
Chen, Haibo .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (07) :1621-1635
[5]   Study and Investigation of SARIMA-based Traffic Prediction Models for the Resource Allocation in NFV networks with Elastic Optical Interconnection [J].
Eramo, V ;
Catena, T. ;
Lavacca, F. G. ;
di Giorgio, F. .
2020 22ND INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS (ICTON 2020), 2020,
[6]   Checkpointing and Migration of IoT Edge Functions [J].
Karhula, Pekka ;
Janak, Jan ;
Schulzrinne, Henning .
PROCEEDINGS OF THE 2ND ACM INTERNATIONAL WORKSHOP ON EDGE SYSTEMS, ANALYTICS AND NETWORKING (EDGESYS '19), 2019, :60-65
[7]   A survey of fault tolerance in cloud computing [J].
Kumari, Priti ;
Kaur, Parmeet .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (10) :1159-1176
[8]   Cloud for Holography and Augmented Reality [J].
Makris, Antonios ;
Boudi, Abderrahmane ;
Coppola, Massimo ;
Cordeiro, Luis ;
Corsini, Massimiliano ;
Dazzi, Patrizio ;
Andilla, Ferran Diego ;
Gonzalez Rozas, Yago ;
Kamarianakis, Manos ;
Pateraki, Maria ;
Le Pham, Thu ;
Protopsaltis, Antonis ;
Raman, Aravindh ;
Romussi, Alessandro ;
Rosa, Luis ;
Spatafora, Elena ;
Taleb, Tarik ;
Theodoropoulos, Theodoros ;
Tserpes, Konstantinos ;
Zschau, Enrico ;
Herzog, Uwe .
2021 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET), 2021, :118-126
[9]  
Park SH, 2018, IEEE INT VEH SYM, P1672, DOI 10.1109/IVS.2018.8500658
[10]   WIDE AREA TRAFFIC - THE FAILURE OF POISSON MODELING [J].
PAXSON, V ;
FLOYD, S .
IEEE-ACM TRANSACTIONS ON NETWORKING, 1995, 3 (03) :226-244