Fault-Tolerant Scheduling Algorithm With Re-Allocation for Divisible Task

被引:9
作者
Xuan, Hejun [1 ]
Wei, Shiwei [2 ]
Tong, Wuning [3 ]
Liu, Daohua [1 ]
Qi, Chuanda [1 ]
机构
[1] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang 464000, Peoples R China
[2] Guilin Univ Aerosp Technol, Sch Comp & Technol, Guilin 541000, Peoples R China
[3] Shaanxi Univ Chinese Med, Sch Sci, Xinyang 712000, Peoples R China
基金
中国国家自然科学基金;
关键词
Divisible task; optimal sequence; task re-allocated; fault-tolerant; START-UP COSTS; TIME TASKS; LOAD DISTRIBUTION; DESIGN; OPTIMIZATION; STRATEGIES;
D O I
10.1109/ACCESS.2018.2881268
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Divisible task fault-tolerant scheduling problems for a heterogeneous system on a general and realistic platform are addressed in this paper, where the communication is in non-blocking message receiving mode, and the processors and communication links may have different speeds and startup overheads. For this kind of problems, the optimal sequence and the fraction of task for each processor are derived first when the fault checkout overhead and checkout time consumption are considered. Then, to decrease the time consumption and checkout overheads, a checkout strategy, which is more suitable for divisible task, is employed. Moreover, an efficient algorithm with the fault fraction units re-allocated is proposed. Finally, the experiments on some simulation examples are conducted and the experimental results indicate that the proposed algorithm is effective, can minimize the expected execution time, and can save the time on fault-tolerant consumption.
引用
收藏
页码:73147 / 73157
页数:11
相关论文
共 38 条
[1]  
[Anonymous], 2012, IEEE IFIP INT C DEPE
[2]   Performance-Driven Load Balancing with a Primary-Backup Approach for Computational Grids with Low Communication Cost and Replication Cost [J].
Balasangameshwara, Jasma ;
Raju, Nedunchezhian .
IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (05) :990-1003
[3]   Heterogeneous Resource Allocation under Degree Constraints [J].
Beaumont, Olivier ;
Eyraud-Dubois, Lionel ;
Thraves Caro, Christopher ;
Rejeb, Hejer .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (05) :926-937
[4]   Divisible Load Theory: A New Paradigm for Load Scheduling in Distributed Systems [J].
Veeravalli Bharadwaj ;
Debasish Ghose ;
Thomas G. Robertazzi .
Cluster Computing, 2003, 6 (1) :7-17
[5]   Design and analysis of load distribution strategies with start-up costs in scheduling divisible loads on distributed networks [J].
Bharadwaj, V ;
Li, XL ;
Ko, CC .
MATHEMATICAL AND COMPUTER MODELLING, 2000, 32 (7-8) :901-932
[6]   Parallel processor configuration design with processing/transmission costs [J].
Charcranoon, S ;
Robertazzi, TG ;
Luryi, S .
IEEE TRANSACTIONS ON COMPUTERS, 2000, 49 (09) :987-991
[7]   Divisible Nonlinear Load Distribution on Heterogeneous Single-Level Trees [J].
Chen, Chi-Yeh ;
Chu, Chih-Ping .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2018, 54 (04) :1664-1678
[8]   Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids [J].
Chtepen, Maria ;
Claeys, Filip H. A. ;
Dhoedt, Bart ;
De Turck, Filip ;
Demeester, Piet ;
Vanrolleghem, Peter A. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2009, 20 (02) :180-190
[9]   Analysis and modeling of task scheduling in wireless sensor network based on divisible load theory [J].
Dai, Liang ;
Shen, Zhong ;
Chen, Ting ;
Chang, Yilin .
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2014, 27 (05) :721-731
[10]   A higher order estimate of the optimum checkpoint interval for restart dumps [J].
Daly, JT .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING THEORY METHODS AND APPLICATIONS, 2006, 22 (03) :303-312