Achieving Guaranteed Service with Fault-Tolerant Resources in Grid

被引:1
|
作者
Goswami, Sukalyan [1 ]
Das, Ajanta [2 ]
机构
[1] Inst Engn & Management, Kolkata, India
[2] Birla Inst Technol Mesra, Dept Comp Sci & Engn, Kolkata Campus, Kolkata 700107, India
来源
INFORMATION AND COMMUNICATION TECHNOLOGY (ICICT 2016) | 2018年 / 625卷
关键词
Grid computing; Job allocation; Runtime backup; Quality-of-service; Resource failure;
D O I
10.1007/978-981-10-5508-9_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Composed of loosely coupled virtual resources, grid, being highly distinguished from traditional high-performance computing, is extensively used in computation-intensive problem solving in the arenas of science and technology. Maintaining performance or balancing load of each resource in grid is always more challenging with high chances of resource failure. The objective of this paper is to improve the efficiency of the Nearest Deadline First Scheduled (NDFS) algorithm considering resource failure a sudden occurrence in grid. The algorithm introduces periodical runtime backup to another available resource for retaining Quality-of-Service as approved in service quality agreement. This paper presents multiple job execution cases through implementation of benchmark codes executed in local grid test bed using Globus Toolkit middleware, with an emphasis on resource failure phenomenon of grid. These experimental results establish the requirements of the proposed algorithm to ensure the job deadline misses get reduced even if unexpected resource failures happen.
引用
收藏
页码:189 / 196
页数:8
相关论文
共 50 条
  • [1] NodeWiz: Fault-tolerant grid information service
    Basu, Sujoy
    Costa, Lauro Beltrao
    Brasileiro, Francisco
    Banerjee, Sujata
    Sharma, Puneet
    Lee, Sung-Ju
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2009, 2 (04) : 348 - 366
  • [2] NodeWiz: Fault-tolerant grid information service
    Sujoy Basu
    Lauro Beltrão Costa
    Francisco Brasileiro
    Sujata Banerjee
    Puneet Sharma
    Sung-Ju Lee
    Peer-to-Peer Networking and Applications, 2009, 2 : 348 - 366
  • [3] Migol: A fault-tolerant service framework for MPI applications in the grid
    Luckow, A
    Schnor, B
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, PROCEEDINGS, 2005, 3666 : 258 - 267
  • [4] Migol: A fault-tolerant service framework for MPI applications in the grid
    Luckow, Andre
    Schnor, Bettina
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2008, 24 (02): : 142 - 152
  • [5] Fault-tolerant certainty grid
    Elmenreich, W
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS 2003, VOL 1-3, 2003, : 1576 - 1581
  • [6] Fault-tolerant grid architecture and practice
    Hai Jin
    DeQing Zou
    HanHua Chen
    JianHua Sun
    Song Wu
    Journal of Computer Science and Technology, 2003, 18 : 423 - 433
  • [7] Fault-tolerant grid architecture and practice
    Jin, H
    Zou, DQ
    Chen, HH
    Sun, JH
    Wu, S
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2003, 18 (04) : 423 - 433
  • [8] Fault-tolerant grid monitoring system
    Li, Yiqi
    Dong, Shouling
    Zhang, Ling
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2006, 34 (SUPPL.): : 164 - 166
  • [9] A fault-tolerant architecture for Grid system
    Liu, LX
    Wu, QY
    Zhou, B
    GRID AND COOPERATIVE COMPUTING GCC 2004, PROCEEDINGS, 2004, 3251 : 58 - 64
  • [10] Achieving fault-tolerant software with rejuvenation and reconfiguration
    Yurcik, W
    Doss, D
    IEEE SOFTWARE, 2001, 18 (04) : 48 - +