Nonparametric estimation of long-tailed density functions and its application to the analysis of World Wide Web traffic

被引:18
作者
Markovitch, NM
Krieger, UR
机构
[1] Russian Acad Sci, Inst Control Sci, Moscow 117806, Russia
[2] Tnova Deutsch Telekom, Technol Zentrum, D-64295 Darmstadt, Germany
[3] Goethe Univ Frankfurt, Dept Comp Sci, D-60054 Frankfurt, Germany
关键词
World Wide Web; nonparametric density estimation; Parzen-Rosenblatt estimate; polygram; long-tailed distribution; consistency;
D O I
10.1016/S0166-5316(00)00031-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The study of WWW-traffic measurements has shown that different traffic characteristics can be modeled by long-tail distributed random variables (r.v.s). In this paper we discuss the nonparametric estimation of the probability density function of long-tailed distributions. Two nonparametric estimates, a Parzen-Rosenblatt kernel estimate and a histogram with variable bin width called polygram, are considered. The consistency of these estimates for heavy-tailed densities is discussed. To provide the consistency of the estimates in the metric space L-1, the transformation of the initial r.v. to a new r.v. distributed on the interval [0, 1] is proposed. Then the proposed estimates are applied to analyze real data of WWW-sessions. The latter are characterized by the sizes of the responses and inter-response intervals as well as the sizes and durations of sub-sessions. By these means the effectiveness of the nonparametric procedures in comparison to parametric models of the WWW-traffic characteristics is demonstrated. (C) 2000 Published by Elsevier Science B.V.
引用
收藏
页码:205 / 222
页数:18
相关论文
共 20 条
[1]  
ABOUJAOUDE S, 1976, ANN I H POINCARE B, V12, P299
[2]  
[Anonymous], 1982, ESTIMATION DEPENDENC
[3]  
BARFORD P, 1999, PERFORM EVAL REV AUG
[4]   DISTRIBUTION ESTIMATION CONSISTENT IN TOTAL VARIATION AND IN 2 TYPES OF INFORMATION DIVERGENCE [J].
BARRON, AR ;
GYORFI, L ;
VANDERMEULEN, EC .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (05) :1437-1454
[5]  
Bol'shev L. N., 1968, TABLES MATH STAT
[6]   CONSISTENT CROSS-VALIDATED DENSITY-ESTIMATION [J].
CHOW, YS ;
GEMAN, S ;
WU, LD .
ANNALS OF STATISTICS, 1983, 11 (01) :25-38
[7]  
Crovella ME, 1998, PRACTICAL GUIDE TO HEAVY TAILS, P3
[8]   Estimating the tail index [J].
Csörgö, S ;
Viharos, L .
ASYMPTOTIC METHODS IN PROBABILITY AND STATISTICS: A VOLUME IN HONOUR OF MIKLOS CSORGO, 1998, :833-881
[9]  
DEVROYE L, 1985, NONPARAMETRIC DENIST
[10]   BEST OBTAINABLE ASYMPTOTIC RATES OF CONVERGENCE IN ESTIMATION OF A DENSITY-FUNCTION AT A POINT [J].
FARRELL, RH .
ANNALS OF MATHEMATICAL STATISTICS, 1972, 43 (01) :170-&