Host load prediction using linear models

被引:130
作者
Peter A. Dinda
David R. O'Hallaron
机构
[1] Northwestern University,Department of Computer Science
[2] Carnegie Mellon University,Computer Science Department
[3] Carnegie Mellon University,Electrical and Computer Engineering Department
关键词
Operating System; Linear Model; Communication Network; Predictive Power; Load Average;
D O I
10.1023/A:1019048724544
中图分类号
学科分类号
摘要
This paper evaluates linear models for predicting the Digital Unix five‐second host load average from 1 to 30 seconds into the future. A detailed statistical study of a large number of long, fine grain load traces from a variety of real machines leads to consideration of the Box–Jenkins models (AR, MA, ARMA, ARIMA), and the ARFIMA models (due to self‐similarity.) We also consider a simple windowed‐mean model. The computational requirements of these models span a wide range, making some more practical than others for incorporation into an online prediction system. We rigorously evaluate the predictive power of the models by running a large number of randomized testcases on the load traces and then data‐mining their results. The main conclusions are that load is consistently predictable to a very useful degree, and that the simple, practical models such as AR are sufficient for host load prediction. We recommend AR(16) models or better for host load prediction. We implement an online host load prediction system around the AR(16) model and evaluate its overhead, finding that it uses miniscule amounts of CPU time and network bandwidth.
引用
收藏
页码:265 / 280
页数:15
相关论文
共 18 条
[1]  
Beran J.(1992)Statistical methods for data with long-range dependence Statist. Sci. 7 404-427
[2]  
Eager D.L.(1986)Adaptive load sharing in homogeneous distributed systems IEEE Trans. Software Engrg. 12 662-675
[3]  
Lazowska E.D.(1980)An introduction to long-memory time series models and fractional differencing J. Time Series Anal. 1 15-29
[4]  
Zahorjan J.(1994)A time series model of longterm NSFNET backbone traffic Proceedings of the IEEE International Conference on Communications (ICC'94) 3 1400-1404
[5]  
Granger C.W.J.(1989)Space-time modelling with long-memory dependence: Assessing Ireland's wind power resource Appl. Statist. 38 1-50
[6]  
Joyeux R.(1981)Fractional differencing Biometrika 68 165-176
[7]  
Groschwitz N.C.(1986)Load-balancing heuristics and process behavior Proceedings of Performance and ACM SIGMETRICS 14 54-69
[8]  
Polyzos G.C.(1991)The available capacity of a privately owned workstation environment Performance Evaluation 12 269-284
[9]  
Haslett J.(1997)Architectural support for quality of service for CORBA objects Theory and Practice of Object Systems 3 55-73
[10]  
Raftery A.E.(undefined)undefined undefined undefined undefined-undefined