Data Preprocessing Techniques for Pre-Fetching and Caching of Web Data through Proxy Server

被引:0
作者
Sathiyamoorthi, V. [1 ]
Bhaskaran, Murali [2 ]
机构
[1] Sona Coll Technol, Dept CSE, Salemi 5, Tamil Nadu, India
[2] Paavai Coll Engn, Paachal 637018, Tamil Nadu, India
来源
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY | 2011年 / 11卷 / 11期
关键词
Web mining; proxy server; data mining; preprocessing; prefetching; caching;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid growth of the Web in terms of Web sites and their users during the last decade has put lots of pressure for Web site owners in reducing the latency of Web pages. Web caching and Web prefetching are two important techniques used to tackle these problems and reduce the noticeable response time perceived by users. These two techniques complement each other since the Web caching technique exploits the temporal locality, whereas Web pre-fetching technique utilizes the spatial locality of Web object. By integrating Web caching and Web pre-fetching techniques, the latency time and search space get reduced. Due to this dramatic changes, a huge amount of data related to the user's interactions with the Web sites are recorded in the Web access log. Web access log plays an important role in predicting the user access pattern and pre-fetching and caching of Web data for better performance. Different data mining techniques can be applied on Web usage data to mine user access patterns and this knowledge can be used in a variety of applications such as system improvement, Web site modification, business intelligence etc. This paper discusses various data preprocessing techniques that are carried out at proxy server access log which generate Web access pattern and can also be used for further applications.
引用
收藏
页码:92 / 98
页数:7
相关论文
共 20 条
[1]  
Berendt B., 2002, LECT NOTES ARTIF INT, P159
[2]  
Buchner A., 1998, SIGMOD REC, V27, P54, DOI DOI 10.1145/306101.306124
[3]   CHARACTERIZING BROWSING STRATEGIES IN THE WORLD-WIDE-WEB [J].
CATLEDGE, LD ;
PITKOW, JE .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1995, 27 (06) :1065-1073
[4]   Efficient data mining for path traversal patterns [J].
Chen, MS ;
Park, JS ;
Yu, PS .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (02) :209-221
[5]  
Cooly Robert, 1997, WEB MINING INFORM PA, P1
[6]  
Cooly Robert, 1999, DATA PREPARATION MIN, P5
[7]  
Corin R., 2002, THESIS
[8]  
Diebold Boris, 2001, AUSTR S INF VIS, P159
[9]   The World-Wide Web: Quagmire or gold mine? [J].
Etzioni, O .
COMMUNICATIONS OF THE ACM, 1996, 39 (11) :65-68
[10]  
Fu YJ, 2000, LECT NOTES COMPUT SC, V1836, P21