A hierarchical modeling and analysis for grid service reliability

被引:53
作者
Dai, Yuan-Shun
Pan, Yi
Zou, Xukai
机构
[1] Indiana Univ Purdue Univ, Dept Comp & Informat Sci, Indianapolis, IN 46202 USA
[2] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30302 USA
关键词
grid reliability; resource management system; Markov model; queuing theory; graph theory;
D O I
10.1109/TC.2007.1034
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Grid computing is a recently developed technology. Although the developmental tools and techniques for the grid have been extensively studied, grid reliability analysis is not easy because of its complexity. This paper is the first one that presents a hierarchical model for the grid service reliability analysis and evaluation. The hierarchical modeling is mapped to the physical and logical architecture of the grid service system and makes the evaluation and calculation tractable by identifying the independence among layers. Various types of failures are interleaved in the grid computing environment, such as blocking failures, time-out failures, matchmaking failures, network failures, program failures, and resource failures. This paper investigates all of them to achieve a complete picture about grid service reliability. Markov models, Queuing theory, and Graph theory are mainly used here to model, evaluate, and analyze the grid service reliability. Numerical examples are illustrated.
引用
收藏
页码:681 / 691
页数:11
相关论文
共 28 条
  • [1] A computational economy for grid computing and its implementation in the Nimrod-G resource broker
    Abramson, D
    Buyya, R
    Giddy, J
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2002, 18 (08): : 1061 - 1074
  • [2] [Anonymous], GRID RESOURCE MANAGE
  • [3] Utilizing widely distributed computational resources efficiently with execution domains
    Basney, J
    Livny, M
    Mazzanti, P
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2001, 140 (1-2) : 246 - 252
  • [4] Adaptive computing on the grid using AppLeS
    Berman, F
    Wolski, R
    Casanova, H
    Cirne, W
    Dail, H
    Faerman, M
    Figueira, S
    Hayes, J
    Obertelli, G
    Schopf, J
    Shao, G
    Smallen, S
    Spring, N
    Su, A
    Zagorodnov, D
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2003, 14 (04) : 369 - 382
  • [5] ALMOST CERTAIN FAULT-DIAGNOSIS THROUGH ALGORITHM-BASED FAULT-TOLERANCE
    BLOUGH, DM
    PELC, A
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (05) : 532 - 539
  • [6] CAO J, 2002, SCI PROGRAMMING-NETH, V10, P135
  • [7] A heuristic approach to generating file spanning trees for reliability analysis of distributed computing systems
    Chen, DJ
    Chen, RS
    Huang, TH
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1997, 34 (10) : 115 - 131
  • [8] Chen Q, 1992, Indoor Air, V2, P154, DOI DOI 10.1111/j.1600-0668.1992.04-23.x
  • [9] Reliability analysis of grid computing systems
    Dai, YS
    Me, M
    Poh, KL
    [J]. 2002 PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2002, : 97 - 104
  • [10] A study of service reliability and availability for distributed systems
    Dai, YS
    Xie, M
    Poh, KL
    Liu, GQ
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2003, 79 (01) : 103 - 112