Web usage mining with intentional browsing data

被引:16
作者
Tao, Yu-Hu [1 ]
Hong, Tzung-Pe [2 ]
Su, Yu-Ming [3 ]
机构
[1] Natl Univ Kaohsiung, Dept Informat Management, Kaohsiung 811, Taiwan
[2] Natl Univ Kaohsiung, Dept Elect Engn, Kaohsiung 811, Taiwan
[3] InfoChamp Syst Corp, Kaohsiung 811, Taiwan
关键词
web usage mining; intentional browsing data; web log files; browsing behaviour; knowledge discovery;
D O I
10.1016/j.eswa.2007.02.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many researches have developed Web usage mining (WUM) algorithms utilizing Web log records in order to discover useful knowledge to be used in supporting business applications and decision making. The quality of WUM in knowledge discovery, however, depends on the algorithm as well as on the data. This research explores a new data source called intentional browsing data (IBD) for potentially improving the effectiveness of WUM applications. IBD is a category of online browsing actions, such as "copy", "scroll", or "save as," and is not recorded in Web log files. Consequently, the research aims to build a basic understanding of IBD which will lead to its easy adoption in WUM research and practice. Specifically, this paper formally defines IBD and clarifies its relationships with other browsing data via a proposed taxonomy. In order to make IBD available like Web log files, an online data collection mechanism for capturing IBD is also proposed and discussed. The potential benefits of IBD can be justified in terms of its enhancing and complementary effectiveness, which are illustrated by the rule implications of Web transaction mining algorithm for an EC application. Introducing IBD opens up the scope of WUM research and applications in knowledge discovery. (c) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1893 / 1904
页数:12
相关论文
共 32 条
[1]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[2]  
Agrawal R., 1994, Proceedings of the 20th International Conference on Very Large Data Bases. VLDB'94, P487
[3]   Web log data warehousing and mining for intelligent web caching [J].
Bonchi, F ;
Giannotti, F ;
Gozzi, C ;
Manco, G ;
Nanni, M ;
Pedreschi, D ;
Renso, C ;
Ruggieri, S .
DATA & KNOWLEDGE ENGINEERING, 2001, 39 (02) :165-189
[4]   CHARACTERIZING BROWSING STRATEGIES IN THE WORLD-WIDE-WEB [J].
CATLEDGE, LD ;
PITKOW, JE .
COMPUTER NETWORKS AND ISDN SYSTEMS, 1995, 27 (06) :1065-1073
[5]  
CHAN C, 1997, THESIS NATL CENTRAL
[6]   Efficient data mining for path traversal patterns [J].
Chen, MS ;
Park, JS ;
Yu, PS .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (02) :209-221
[7]  
CHEN Z, 2000, 6 INT C INF MAN RES
[8]  
Cooley R., 1999, Knowledge and Information Systems, V1, P5
[9]   Determining WWW user's next access and its application to pre-fetching [J].
Cunha, CR ;
Jaccoud, CFB .
SECOND IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, PROCEEDINGS, 1997, :6-11
[10]  
FANN C, 1999, THESIS NATL PINGT U