Discovering Statistical Models of Availability in Large Distributed Systems: An Empirical Study of SETI@home

被引：66

作者：

Javadi, Bahman ^{[1
]}

Kondo, Derrick ^{[2
]}

Vincent, Jean-Marc ^{[3
]}

Anderson, David P. ^{[4
]}

机构：

[1] Univ Melbourne, Comp Sci & Software Engn Dept, Melbourne, Vic 3053, Australia

[2] ZIRST, ENSIMAG, Lab LIG, INRIA, F-38330 Montbonnot St Martin, France

[3] Univ Grenoble 1, Dept Comp Sci, LIG, F-38041 Grenoble, France

[4] Univ Calif Berkeley, Space Sci Lab, Berkeley, CA 94720 USA

来源：

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS | 2011年 / 22卷 / 11期

关键词：

Statistical availability models; reliability; resource failures; stochastic scheduling; CAPACITY;

D O I：

10.1109/TPDS.2011.50

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In the age of cloud, Grid, P2P, and volunteer distributed computing, large-scale systems with tens of thousands of unreliable hosts are increasingly common. Invariably, these systems are composed of heterogeneous hosts whose individual availability often exhibit different statistical properties (for example stationary versus nonstationary behavior) and fit different models (for example exponential, Weibull, or Pareto probability distributions). In this paper, we describe an effective method for discovering subsets of hosts whose availability have similar statistical properties and can be modeled with similar probability distributions. We apply this method with about 230,000 host availability traces obtained from a real Internet-distributed system, namely SETI@home. We find that about 21 percent of hosts exhibit availability, that is, a truly random process, and that these hosts can often be modeled accurately with a few distinct distributions from different families. We show that our models are useful and accurate in the context of a scheduling problem that deals with resource brokering. We believe that these methods and models are critical for the design of stochastic scheduling algorithms across large systems where host availability is uncertain.

引用

页码：1896 / 1903

页数：8

共 18 条

[1]

[Anonymous], P 9 IEEE INT S CLUST

[2]

[Anonymous], 2006, Introduction to Time Series and Forecasting

[3]

[Anonymous], 2010, CCGrid, DOI DOI 10.1109/CCGRID.2010.71

[4]

Anselmi J., 2010, 00457603 INRIA

[5] INDIVIDUAL VERSUS SOCIAL OPTIMIZATION IN THE ALLOCATION OF CUSTOMERS TO ALTERNATIVE SERVERS [J].

BELL, CE ;

STIDHAM, S .

MANAGEMENT SCIENCE, 1983, 29 (07) :831-839

[6]

Bolosky W., 2000, P ACM SIGMETRICS INT

[7] A Goodness-of-Fit statistical toolkit [J].

Cirrone, GAP ;

Donadio, S ;

Guatelli, S ;

Mantero, A ;

Mascialino, B ;

Parlati, S ;

Pia, MG ;

Pfeiffer, A ;

Ribon, A ;

Viarengo, P .

IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2004, 51 (05) :2056-2063

[8]

Elkan C., 2003, MACHINE LEARNING-INTERNATIONAL WORKSHOP THEN CONFERENCE, P147, DOI DOI 10.1016/0026-2714(92)90278-S

[9]

Hordijk A., 2001, Integer Programming and Combinatorial Optimization. 8th International IPCO Conference. Proceedings (Lecture Notes in Computer Science Vol.2081), P236

[10]

Iosup A., 2008, P IEEE S HIGH PERF D

← 1 2 →