An Improved Session Identification Approach in Web Log Mining for Web Personalization

被引:2
作者
Sengottuvelan, P. [1 ]
Lokeshkumar, R. [1 ]
Gopalakrishnan, T. [1 ]
机构
[1] Bannari Amman Inst Technol, Dept Informat Technol, Coimbatore, Tamil Nadu, India
来源
JOURNAL OF INTERNET TECHNOLOGY | 2017年 / 18卷 / 04期
关键词
Data preprocessing; Web log mining; Session Identification; Web personalization;
D O I
10.6138/JIT.2017.18.4.20150113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This Web based applications are increasing at an enormous speed and as a result its users are also increasing at an exponential speed. The innovative and evolutionary changes in technology have made it possible to capture the user's fundamental nature and interactions with web applications through web server log file as web usage. In order to design attractive web sites, designers must understand their user's needs. Therefore analyzing navigational behavior of users is an important part of web page design. Web Usage Mining (WUM) is the application of data mining techniques to web usage data in order to discover the patterns that can be used to analyze the user's navigational behavior. Since web contains large amount of "irrelevant information" in the web log, the original log file cannot be directly used in the WUM process. Therefore, the preprocessing of web log file becomes very important in order to improve the accuracy in Web log mining. The basic procedure of data preprocessing is introduced firstly in this paper with the traditional session identification algorithm is fully analyzed, on the basis of which a session identification algorithm on page threshold and dynamic timeout is presented. Finally, the initial timeout is computed for each page according to sessions formed, combining with the importance degree of improved dynamic threshold algorithm which discards the uninterested attributes from log file. Comparing experiment shows that the algorithm Proposed can obtain a better performance on session identification and user interests which is the key for web personalization.
引用
收藏
页码:723 / 730
页数:8
相关论文
共 23 条
[11]  
Hu J, 2005, 2005 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, P257
[12]  
Javed M, 2011, LECT NOTES ARTIF INT, V7091, P136, DOI 10.1007/978-3-642-25975-3_13
[13]   Web mining: Research and practice [J].
Kolari, P ;
Joshi, A .
COMPUTING IN SCIENCE & ENGINEERING, 2004, 6 (04) :49-53
[14]   The Construction of Transactions for Web Usage Mining [J].
Li, Yan ;
Feng, Bo-qin .
PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NATURAL COMPUTING, VOL I, 2009, :121-124
[15]   A Hybrid Information Filtering Algorithm Based on Distributed Web log Mining [J].
Ling Yun ;
Wang Xun ;
Gu Huamao .
THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, :1086-1091
[16]  
Mobasher B., 2007, The Adaptive Web. Methods and Strategies of Web Personalization, P90
[17]  
Cruz JRP, 2012, J INTERNET TECHNOL, V13, P67
[18]   Web usage mining as a tool for personalization: A survey [J].
Pierrakos, D ;
Paliouras, G ;
Papatheodorou, C ;
Spyropoulos, CD .
USER MODELING AND USER-ADAPTED INTERACTION, 2003, 13 (04) :311-372
[19]  
Spiliopoulou M., 1999, EDBT WORKSH WEBDB98, P27
[20]   Web usage mining with intentional browsing data [J].
Tao, Yu-Hu ;
Hong, Tzung-Pe ;
Su, Yu-Ming .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (03) :1893-1904