Fault tolerant job scheduling in computational grid

被引:6
作者
Nazir, Babar [1 ]
Khan, Taimoor [1 ]
机构
[1] COMSATS Inst Informat Technol, Dept Comp Sci, Abbottabad, Pakistan
来源
SECOND INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES 2006, PROCEEDINGS | 2006年
关键词
grid computing; grid scheduling; computational grid; job scheduling; fault tolerance; resource management;
D O I
10.1109/ICET.2006.335930
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In large-scale grids, the probability of a failure is much greater than in traditional parallel systems [1]. Therefore, fault tolerance has become a crucial area in grid computing. In this paper, we address the problem of fault tolerance in term of resource failure. We devise a strategy for fault tolerant job scheduling in computational grid. Proposed strategy maintains history of the fault occurrence of resource in Grid Information Service (GIS). Whenever a resource broker has job to schedule it uses the resource fault occurrence history information from GIS and depending on this information use different intensity of check pointing and replication while scheduling the job on resources which have different tendency towards fault. Using check pointing proposed scheme can make grid scheduling more reliable and efficient. Further, it increases the percentage of jobs executed within specified deadline and allotted budget, hence helping in making grid trustworthy. Through simulation we have evaluated the peformance of the proposed strategy. The experimental results demonstrate that proposed strategy effectively schedule the grid jobs in fault tolerant way in spite of highly dynamic nature of grid.
引用
收藏
页码:708 / +
页数:3
相关论文
共 10 条
[1]  
[Anonymous], 1999, GRID BLUEPRINT FUTUR
[2]  
[Anonymous], 2004, GRID 2 BLUEPRINT NEW
[3]   THE N-VERSION APPROACH TO FAULT-TOLERANT SOFTWARE [J].
AVIZIENIS, A .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1985, 11 (12) :1491-1501
[4]  
BURCHARD LO, 2005, P 17 INT S COMP ARCH
[5]  
BUYYAL R, 2005, INTERSCIENCE 0131
[6]  
Fernandes Lopes R, 2006, P 6 IEEE INT S CLUST
[7]  
HUDA MT, 2005, 1 INT C E SCI GRID C
[8]  
KHOO BTB, 2001, DYNAMIC ESTIMATION S
[9]  
MEIDEIROS R, 2003, P 4 INT WORKSH GRID
[10]  
[No title captured]