A Fault-Tolerant Model for Performance Optimization of a Fog Computing System

被引:27
作者
Zhang, Peiyun [1 ,2 ]
Chen, Yutong [1 ]
Zhou, Mengchu [3 ]
Xu, Ge [2 ]
Huang, Wenjun [1 ]
Al-Turki, Yusuf [4 ,5 ]
Abusorrah, Abdullah [4 ,5 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China
[2] Anhui Normal Univ, Sch Comp & Informat, Wuhu 241003, Peoples R China
[3] New Jersey Inst Technol, Helen & John C Hartmann Dept Elect & Comp Engn, Newark, NJ 07102 USA
[4] King Abdulaziz Univ, Dept Elect & Comp Engn, Fac Engn, Jeddah 21481, Saudi Arabia
[5] King Abdulaziz Univ, Ctr Res Excellence Renewable Energy & Power Syst, Jeddah 21481, Saudi Arabia
基金
中国国家自然科学基金;
关键词
Fault tolerant systems; Fault tolerance; Hidden Markov models; Cloud computing; Reliability; Computational modeling; Markov processes; Fault tolerant; fog computing; improved simulated annealing (ISA); Markov chain; ALGORITHM; CONSUMPTION; DEPLOYMENT; INTERNET; IFOGSIM; THINGS; EDGE;
D O I
10.1109/JIOT.2021.3088417
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In a distributed heterogeneous fog environment, fog nodes may change their state at any time. Their reliability changes accordingly. A dynamic analysis of state changes can help one detect fault-tolerant fog nodes, which is conducive to promoting the reliability of fog services. This article proposes a fault-tolerant model based on a Markov chain for a fog system's performance optimization. The real-time reliability of fog nodes is analyzed by using dynamic distributed parameters. Thus, the state transition process of fog nodes is modeled with a continuous-time Markov chain. The steady-state probability of a fog system is analyzed. Then, a fault-tolerant strategy and its algorithms are designed to select nodes with the minimum cost based on their steady-state probabilities. The proposed method can predict the number of faulty ones of a fog system via the steady-state probability. An intelligent optimization method called simulated annealing (ISA) is designed and used to select the most appropriate fog nodes to substitute faulty ones. The experimental results show that the method is feasible and effective for selecting the right fault-tolerant nodes according to different performance requirements. ISA can well outperform such methods as random selection, discrete differential evolution, and simulated annealing in terms of cost and time.
引用
收藏
页码:1725 / 1736
页数:12
相关论文
共 47 条
[1]  
Aarts E., 2005, SIMULATED ANNEALING
[2]   Deploying Fog Computing in Industrial Internet of Things and Industry 4.0 [J].
Aazam, Mohammad ;
Zeadally, Sherali ;
Harras, Khaled A. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (10) :4674-4682
[3]   Simulated annealing with restarts for the optimization of electromagnetic devices [J].
Alfonzetti, S ;
Dilettoso, E ;
Salerno, N .
IEEE TRANSACTIONS ON MAGNETICS, 2006, 42 (04) :1115-1118
[4]   Enabling IoT interoperability through opportunistic smartphone-based mobile gateways [J].
Aloi, G. ;
Caliciuri, G. ;
Fortino, G. ;
Gravina, R. ;
Pace, P. ;
Russo, W. ;
Savaglio, C. .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2017, 81 :74-84
[5]   An Algorithm for Performance Evaluation of Closed-Loop Spare Supply Systems With Generally Distributed Failure and Repair Times [J].
Assadi, Morteza ;
Mobin, Mohammadsadegh ;
Cheraghi, Hossein ;
Li, Zhaojun .
IEEE TRANSACTIONS ON RELIABILITY, 2019, 68 (02) :678-688
[6]   Distributed Fault-Tolerant Control of Large-Scale Systems: An Active Fault Diagnosis Approach [J].
Boem, Francesca ;
Gallo, Alexander J. ;
Raimondo, Davide M. ;
Parisini, Thomas .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2020, 7 (01) :288-301
[7]   A Dynamic Service Migration Mechanism in Edge Cognitive Computing [J].
Chen, Min ;
Li, Wei ;
Fortino, Giancarlo ;
Hao, Yixue ;
Hu, Long ;
Humar, Iztok .
ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2019, 19 (02)
[8]   Analysis of a Fault-Tolerant Framework for Reliability Prediction of Service-Oriented Architecture Systems [J].
Chiang, Meng-Chu ;
Huang, Chin-Yu ;
Wu, Cheng-Yang ;
Tsai, Chun-Ying .
IEEE TRANSACTIONS ON RELIABILITY, 2021, 70 (01) :13-48
[9]   Robust Deadlock Avoidance and Control of Automated Manufacturing Systems With Assembly Operations Using Petri Nets [J].
Du, Nan ;
Hu, Hesuan ;
Zhou, MengChu .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (04) :1961-1975
[10]   SPINE2: developing BSN applications on heterogeneous sensor nodes [J].
Fortino, Giancarlo ;
Guerrieri, Antonio ;
Bellifemine, Fabio L. ;
Giannantonio, Roberta .
2009 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL EMBEDDED SYSTEMS, 2009, :128-+