Adaptive Fault-Tolerant Strategy for Latency-Aware IoT Application Executing in Edge Computing Environment

被引:17
作者
Mudassar, Muhammad [1 ,2 ]
Zhai, Yanlong [1 ]
Lejian, Liao [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing 100081, Peoples R China
[2] COMSATS Univ Islamabad, Dept Comp Sci, Vehari Campus, Vehari 61100, Pakistan
关键词
Internet of Things; Checkpointing; Fault tolerant systems; Fault tolerance; Reliability; Edge computing; Cloud computing; Distributed computing; edge computing; fault tolerance; Internet of Things (IoT); INTERNET;
D O I
10.1109/JIOT.2022.3144026
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge computing has recently evolved that offers to execute jobs efficiently by pushing cloud capabilities to edge of the network, this improves the quality of services to latency-oriented Internet of Things (IoT) applications when compared with cloud computing. By using current smart devices as edge nodes, edge computing can provide elastic resources that allow distributed data processing in a decentralized way. Still these smart devices are resource constrained in nature and tends to face a high failure rate than traditional distributed systems, the implementation of a fault-tolerant system that ensures the reliability and application availability becomes a key requirement. In this article, we propose a fault-tolerance methodology based on checkpointing and replication for the edge computing. Our proposed system uses a smart checkpointing for the IoT application tasks executing in a distributed edge network, and by replicating the checkpoint files on alternative edge nodes in the vicinity allowed to increase the system reliability. The experimental results show that our approach is effective in terms of reliability and availability of tasks executing in the edge network along with meeting deadlines of an IoT application.
引用
收藏
页码:13250 / 13262
页数:13
相关论文
共 33 条
  • [1] Intelligent Checkpointing Strategies for IoT System Management
    Aissaoui, Francois
    Cooperman, Gene
    Monteil, Thierry
    Tazi, Said
    [J]. 2017 IEEE 5TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2017), 2017, : 305 - 312
  • [2] Fault-tolerance in the borealis distributed stream processing system
    Balazinska, Magdalena
    Balakrishnan, Hari
    Madden, Samuel R.
    Stonebraker, Michael
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2008, 33 (01):
  • [3] DRAW: Data Replication for Enhanced Data Availability in IoT-based Sensor Systems
    Bin Qaim, Waleed
    Ozkasap, Oznur
    [J]. 2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 770 - 775
  • [4] Chen WH, 2014, J INF SCI ENG, V30, P1167
  • [5] Cherrier S, 2014, 2014 IEEE WORLD FORUM ON INTERNET OF THINGS (WF-IOT), P532, DOI 10.1109/WF-IoT.2014.6803224
  • [6] Dawei Sun, 2012, Proceedings of the 2012 32nd International Conference on Distributed Computing Systems Workshops (ICDCS Workshops), P578, DOI 10.1109/ICDCSW.2012.6
  • [7] Workflows and e-Science: An overview of workflow system features and capabilities
    Deelman, Ewa
    Gannon, Dennis
    Shields, Matthew
    Taylor, Ian
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (05): : 528 - 540
  • [8] Edge of Things: The Big Picture on the integration of Edge, IoT and the Cloud in a Distributed Computing Environment
    El-Sayed, Hesham
    Sankar, Sharmi
    Prasad, Mukesh
    Puthal, Deepak
    Gupta, Akshansh
    Mohanty, Manoranjan
    Lin, Chin-Teng
    [J]. IEEE ACCESS, 2018, 6 : 1706 - 1717
  • [9] Ghodsi Z, 2017, ICCAD-IEEE ACM INT, P376, DOI 10.1109/ICCAD.2017.8203802
  • [10] Grover J, 2018, IEEE SENSOR, P609