Popularity-based PPM: An effective web prefetching technique for high accuracy and low storage

被引:15
作者
Chen, X [1 ]
Chen, XD [1 ]
机构
[1] Coll William & Mary, Dept Comp Sci, Williamsburg, VA 23187 USA
来源
2002 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, PROCEEDING | 2002年
关键词
D O I
10.1109/ICPP.2002.1040885
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Prediction by Partial Match (PPM) is a commonly used technique in Web prefetching, where prefetching decisions are made based on historical URLs in a dynamically maintained Markov prediction tree. Existing approaches either widely store the URL nodes by building the tree with a fixed height in each branch, or only store the branches with frequently accessed URLs. Building the popularity information into the Markov prediction tree, we propose a new prefetching, model, called popularity-based PPM. In this model, the tree is dynamically updated with a variable height in each set of branches where a popular URL can lead a set of long branches, and a less popular document leads a set of short ones, Since majority root nodes are popular URLs in our approach, the space allocation for storing nodes are effectively utilized. We have also included two additional optimizations in this model: (1) directly linking a root node to duplicated popular nodes in a surfing path to give popular URLs more considerations for prefetching; and (2) making a space optimization after the tree is built to further remove less popular nodes. Our trace-driven simulation results comparatively show a significant space reduction and an improved prediction accuracy of the proposed prefetching technique.
引用
收藏
页码:296 / 304
页数:9
相关论文
共 24 条
[1]  
AMRKATOS EP, 1996, 173 ICSFORTH
[2]  
[Anonymous], P 2 USENIX S INT TEC
[3]   Changes in Web client access patterns: Characteristics and caching implications [J].
Barford P. ;
Bestavros A. ;
Bradley A. ;
Crovella M. .
World Wide Web, 1999, 2 (1-2) :15-28
[4]  
BESTRAVOS A, 1995, P 4 ACM INT C INF KN
[5]  
Brin S., 1998, 7 INT WORLD WIDE WEB
[6]  
CACERES R, 1998, P WORKSH INT SERV PE
[7]  
CHEN X, 2001, POPULARITY BASED WEB
[8]  
CHEN X, 2001, P 2 PERF ARCH WEB SE
[9]   DATA-COMPRESSION USING ADAPTIVE CODING AND PARTIAL STRING MATCHING [J].
CLEARY, JG ;
WITTEN, IH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1984, 32 (04) :396-402
[10]  
COHEN T, 1998, P ACM SIGCOMM 1998, P241