A fault-tolerant aware scheduling method for fog-cloud environments

被引：14

作者：

Alarifi, Abdulaziz ^{[1
]}

Abdelsamie, Fathi ^{[2
]}

Amoon, Mohammed ^{[1
,3
]}

机构：

[1] King Saud Univ, Dept Comp Sci, Community Coll, Riyadh, Saudi Arabia

[2] Menoufia Univ, Dept Elect & Elect Commun, Fac Elect Engn, Menoufia, Egypt

[3] Menoufia Univ, Dept Comp Sci & Engn, Fac Elect Engn, Menoufia, Egypt

来源：

PLOS ONE | 2019年 / 14卷 / 10期

关键词：

COMPUTING ENVIRONMENT; SERVICE;

D O I：

10.1371/journal.pone.0223902

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Fog computing is a promising technology that leverages the resources to provide services for requests of IoT (Internet of Things) devices at the cloud edge. The high dynamic and heterogeneous nature of devices at the cloud edge causes failures to be a popular event and therefore fault tolerance became indispensable. Most early scheduling and fault-tolerant methods did not highly consider time-sensitive requests. This increases the possibility of latencies for serving these requests which causes unfavorable impacts. This paper proposes a fault-tolerant scheduling method (FTSM) for allocating services' requests to the most sufficient devices in fog-cloud IoT-based environments. The main purpose of the proposed method is to reduce the latency and overheads of services and to increase the reliability and capacity of the cloud. The method depends on categorizing devices that can issue requests into three classes according to the type of service required. These classes are time-sensitive, time-tolerant and core. Each time-sensitive request is directly mapped to one or more edge devices using a pre-prepared executive list of devices. Each time-tolerant request may be assigned to one or more devices at the cloud edge or the cloud core. Core requests are assigned to resources at the cloud core. In order to achieve fault tolerance, the proposed method selects the most suitable fault-tolerant technique from replication, check-pointing and resubmission techniques for each request while most existing methods consider only one technique. The effectiveness of the proposed method is assessed using average service time, throughput, operation costs, success rate and capacity percentage as performance indicators.

引用

页数：24

共 28 条

[1] Fault tolerance aware scheduling technique for cloud computing environment using dynamic clustering algorithm [J].

Abdulhamid, Shafi'i Muhammad ;

Abd Latiff, Muhammad Shafie ;

Madni, Syed Hamid Hussain ;

Abdullahi, Mohammed .

NEURAL COMPUTING & APPLICATIONS, 2018, 29 (01) :279-293

[2] A checkpointed league championship algorithm-based cloud scheduling scheme with secure fault tolerance responsiveness [J].

Abdulhamid, Shafi'i Muhammad ;

Abd Latiff, Muhammad Shafie .

APPLIED SOFT COMPUTING, 2017, 61 :670-680

[3]

Alhosban A., 2013, P 2013 INT C COMP SY

[4] Adaptive Framework for Reliable Cloud Computing Environment [J].

Amoon, Mohammed .

IEEE ACCESS, 2016, 4 :9469-9478

[5] A job checkpointing system for computational grids [J].

Amoon, Mohammed .

OPEN COMPUTER SCIENCE, 2013, 3 (01) :17-26

[6]

Byrne J, 2017, WINT SIMUL C PROC, P4568, DOI 10.1109/WSC.2017.8248208

[7] Checkpointing as a Service in Heterogeneous Cloud Environments [J].

Cao, Jiajun ;

Simonin, Matthieu ;

Cooperman, Gene ;

Morin, Christine .

2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, :61-70

[8] RAINA: Reliability and Adaptability in Android for Fog Computing [J].

Dantu, Karthik ;

Ko, Steven Y. ;

Ziarek, Lukasz .

IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (04) :41-45

[9]

Das P, 2013, 2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), P473

[10] Checkpoint-based Fault-tolerant Infrastructure for Virtualized Service Providers [J].

Goiri, Inigo ;

Julia, Ferran ;

Guitart, Jordi ;

Torres, Jordi .

PROCEEDINGS OF THE 2010 IEEE-IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2010, :455-462

← 1 2 3 →