Wikipedia workload analysis for decentralized hosting

被引:204
作者
Urdaneta, Guido [1 ]
Pierre, Guillaume [1 ]
van Steen, Maarten [1 ]
机构
[1] Vrije Univ Amsterdam, Dept Comp Sci, NL-1081 HV Amsterdam, Netherlands
关键词
Workload analysis; Wikipedia; Decentralized hosting; P2P;
D O I
10.1016/j.comnet.2009.02.019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We study an access trace containing a sample of Wikipedia's traffic over a 107-day period aiming to identify appropriate replication and distribution strategies in a fully decentralized hosting environment. We perform a global analysis of the whole trace, and a detailed analysis of the requests directed to the English edition of Wikipedia. In our study, we classify client requests and examine aspects such as the number of read and save operations, significant load variations and requests for nonexisting pages. We also review proposed decentralized wiki architectures and discuss how they would handle Wikipedia's workload. We conclude that decentralized architectures must focus on applying techniques to efficiently handle read operations while maintaining consistency and dealing with typical issues on decentralized systems such as churn, unbalanced loads and malicious participating nodes. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1830 / 1845
页数:16
相关论文
共 35 条
  • [1] Adler Stephen., SLASHDOT EFFECT ANAL
  • [2] *AL INT, 2007, AL WEB SEARCH TOP 50
  • [3] ALMEIDA V, 1996, P IEEE C PAR DISTR I
  • [4] ALSHATTNAWI S, 2008, P INT S COLL TECHN S
  • [5] [Anonymous], 2007, Proceedings of the International Symposium on Wikis, DOI DOI 10.1145/1296951.1296968
  • [6] Arlitt M., 2001, ACM T INTERNET TECHN, V1, P44
  • [7] ARLITT MF, 1996, P ACM SIGMETRICS 96, P126, DOI DOI 10.1145/233013.233034
  • [8] Bent L., 2004, Proceedings of the 13th international conference onWorldWideWeb,WWW'04, P522, DOI DOI 10.1145/988672.988743
  • [9] Bergsma M., 2007, Wikimedia architecture
  • [10] Blake C., 2003, Proceedings of the 9th Workshop on Hot Topics in Operating Systems. HotOS'03, P1