Impact of traffic distribution on web cache performance

被引:0
作者
Zotano, Manuel Gómez [1 ]
Gómez-Sanz, Jorge [2 ]
Pavón, Juan [2 ]
机构
[1] Corporacion de Radiotelevision Espanola, Alcalde Sainz de Baranda 92, Madrid
[2] Facultad de Informática UCM, Universidad Complutense de Madrid, Ciudad Universitaria s/n, Madrid
关键词
Hit rate; Web cache; Web logs; Web performance; Zipf distribution;
D O I
10.1504/IJWET.2015.072349
中图分类号
学科分类号
摘要
Caches are a critical element of web-based information systems. Understanding the expected behaviour of cache policies is especially important for achieving good quality of service. Existing works have suggested that the behaviour of the web demand can be modelled as a Zipf distribution with α≤1. New evidence, which is presented in this paper, shows that today websites are following Zipf distributions with α > 1. This article analyses real logs obtained from the client layer of high traffic websites. The main result of this article is that under these conditions, the cache hit ratio can be extremely high with a very small cache size. This means that a very expensive and high resource demanding cache is not needed for effective implementation: A cache size equal to 0.6% of the working set is enough to reach more than 80% of hit ratio, once the right replacement policy has been chosen. Copyright © 2015 Inderscience Enterprises Ltd.
引用
收藏
页码:202 / 213
页数:11
相关论文
共 19 条
  • [1] Arlitt M., Jin T., A workload characterization study of the 1998 World Cup web site, Network, IEEE, 14, 3, pp. 30-37, (2000)
  • [2] Arlitt M., Cherkasova L., Dilley J., Friedrich R., Jin T., Evaluating content management techniques for web proxy caches, Proceedings of the 2nd Workshop on Internet Server Performance, (1999)
  • [3] Breslau L., Cao P., Fan L., Phillips G., Shenker S., Web caching and Zipf-like distributions: Evidence and implications, INFOCOM '99, Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies, Proceedings, 1, pp. 126-134, (1999)
  • [4] Cherkasova L., Ciardo G., Role of aging, frequency, and size in web cache replacement policies, High-Performance Computing and Networking, pp. 114-212, (2001)
  • [5] Chesire M., Wolman A., Voelker G.M., Levy H.M., Measurement and analysis of a streaming media workload, USITS, 1, pp. 1-1, (2001)
  • [6] ElAarag H., Romano S., Cobb J., Web Proxy Cache Replacement Strategies: Simulation, Implementation, and Performance Evaluation, (2012)
  • [7] Fielding R., Gettys J., Mogul J., Frystyk H., Masinter L., Leach P., Berners-Lee T., Hypertext Transfer Protocol HTTP/1.1, (1999)
  • [8] Heen T.F., Lyngstl K., Renard J., The Varnish Book, (2013)
  • [9] Huang Q., Birman K., Van Renesse R., Lloyd W., Kumar S., Li H.C., An analysis of Facebook photo caching, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, pp. 167-181, (2013)
  • [10] Mahanti A., Carlsson N., Arlitt M., Williamson C., A tale of the tails: Power-laws in internet measurements, Network, 27, 1, pp. 59-64, (2013)