Web Usage Mining: Dynamic Methodology to Preprocessing Web Logs

被引:2
作者
Manchanda, Mahesh [1 ]
Gupta, Neena [2 ]
机构
[1] Graph Era Hill Univ, Dehra Dun, Uttar Pradesh, India
[2] Gurukul Kangri Vishwavidyalaya, Haridwar, India
来源
HELIX | 2018年 / 8卷 / 05期
关键词
Web Usage Mining; Data Cleaning; URL Rank; GSPAN; Web Pre-Fetching;
D O I
10.29042/2018-3810-3815
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Internet is a huge source of massive information for retrieving information and searching knowledge from WWW, leading to increase network traffic, access delay & server overload, which results in poor web services. With the use of Web-caching & web prefetching techniques to enhance the performance of web services where web mining techniques play an important role to decide which web object should be pm-fetched from server and stored in proxy cache memory so that the web object with high probability of request, in the next couple of days, serves as the base of the proxy cache. But for efficient web mining and to extract meaningful usage access pattern, the raw log file must be transformed into a meaningful & formatted file. This paper proposed a new dynamic preprocess technique to create a dynamic training dataset for prediction model using web mining, and Graph based substructure Pattern Mining (GSPAN) for improved preprocessing using proxy log. The proposed model would help in minimizing the cache size by 40% thus improving the overall performance.
引用
收藏
页码:3810 / 3815
页数:6
相关论文
共 50 条
[41]   Pattern Discovery of Web Usage Mining [J].
Nina, Shahnaz Parvin ;
Rahaman, Md Mahamudur ;
Bhuiyan, Md Khairul Islam ;
Ahmed, Khandakar Entenam Unayes .
PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT, VOL 1, 2009, :499-503
[42]   A Comprehensive Study of Web Usage Mining [J].
Dhandi, Monika ;
Chakrawarti, Rajesh Kumar .
2016 SYMPOSIUM ON COLOSSAL DATA ANALYSIS AND NETWORKING (CDAN), 2016,
[43]   A Probabilistic Model for Web Usage Mining [J].
David, Nicoleta ;
Patrascu, Lucian ;
Sasu, Adela ;
Damian, Daniela .
PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND INFORMATICS, 2009, :129-+
[44]   IMPROVING THE INTERESTINGNESS OF WEB USAGE MINING [J].
杨怡玲 ;
管旭东 ;
尤晋元 .
Journal of Shanghai Jiaotong University, 2002, (01) :15-22
[45]   Performance Evaluation of the MapReduce-based Parallel Data Preprocessing Algorithm in Web Usage Mining with Robot Detection Approaches [J].
Srivastava, Mitali ;
Srivastava, Atul Kumar ;
Garg, Rakhi ;
Mishra, P. K. .
IETE TECHNICAL REVIEW, 2022, 39 (04) :865-879
[46]   Personalizing Web Recommendations Using Web Usage Mining and Web Semantics with Time Attribute [J].
Moh, Teng-Sheng ;
Saxena, Neha Sushi .
INFORMATION SYSTEMS, TECHNOLOGY AND MANAGEMENT, PROCEEDINGS, 2010, 54 :244-254
[47]   Personalizing web recommendations using web usage mining and web semantics with time attribute [J].
Moh T.-S. ;
Saxena N.S. .
Communications in Computer and Information Science, 2010, 54 :244-254
[48]   Study on Web Mining Algorithm Based on Usage Mining [J].
Han, Qingtian ;
Gao, Xiaoyan ;
Wu, Wenguo .
9TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED INDUSTRIAL DESIGN & CONCEPTUAL DESIGN, VOLS 1 AND 2: MULTICULTURAL CREATION AND DESIGN - CAID& CD 2008, 2008, :1121-+
[49]   An Efficient Periodic Web Content Recommendation Based on Web Usage Mining [J].
Khatri, Ravi ;
Gupta, Daya .
2015 IEEE 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION SYSTEMS (RETIS), 2015, :132-137
[50]   A Web Usage Lattice Based Mining Approach for Intelligent Web Personalization [J].
Zhou, Baoyao ;
Hui, Siu Cheung ;
Fong, Alvis C. M. .
INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2005, 1 (03) :137-+