Web Usage Mining: Dynamic Methodology to Preprocessing Web Logs

被引:2
作者
Manchanda, Mahesh [1 ]
Gupta, Neena [2 ]
机构
[1] Graph Era Hill Univ, Dehra Dun, Uttar Pradesh, India
[2] Gurukul Kangri Vishwavidyalaya, Haridwar, India
来源
HELIX | 2018年 / 8卷 / 05期
关键词
Web Usage Mining; Data Cleaning; URL Rank; GSPAN; Web Pre-Fetching;
D O I
10.29042/2018-3810-3815
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Internet is a huge source of massive information for retrieving information and searching knowledge from WWW, leading to increase network traffic, access delay & server overload, which results in poor web services. With the use of Web-caching & web prefetching techniques to enhance the performance of web services where web mining techniques play an important role to decide which web object should be pm-fetched from server and stored in proxy cache memory so that the web object with high probability of request, in the next couple of days, serves as the base of the proxy cache. But for efficient web mining and to extract meaningful usage access pattern, the raw log file must be transformed into a meaningful & formatted file. This paper proposed a new dynamic preprocess technique to create a dynamic training dataset for prediction model using web mining, and Graph based substructure Pattern Mining (GSPAN) for improved preprocessing using proxy log. The proposed model would help in minimizing the cache size by 40% thus improving the overall performance.
引用
收藏
页码:3810 / 3815
页数:6
相关论文
共 50 条
[21]   Using Entropy in Web Usage Data Preprocessing [J].
Munk, Michal ;
Benko, Lubomir .
ENTROPY, 2018, 20 (01)
[22]   Analysis of web usage mining [J].
Hui Yu ;
Zhongmin Lu .
PROCEEDINGS OF THE 2006 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING, 2006, :1291-1296
[23]   An Overview on Web Usage Mining [J].
Neelima, G. ;
Rodda, Sireesha .
EMERGING ICT FOR BRIDGING THE FUTURE, VOL 2, 2015, 338 :647-655
[24]   Advances in web usage mining [J].
Pabarskaite, Z ;
Raudys, A .
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMPUTER SCIENCE II, 2002, :508-512
[25]   A methodology for web usage mining and its application to target group identification [J].
Araya, S ;
Silva, M ;
Weber, R .
FUZZY SETS AND SYSTEMS, 2004, 148 (01) :139-152
[26]   Integrating Web conceptual modeling and Web usage mining [J].
Meo, Rosa ;
Lanzi, Pier Luca ;
Matera, Maristella ;
Esposito, Roberto .
ADVANCES IN WEB MINING AND WEB USAGE ANALYSIS, 2006, 3932 :135-148
[27]   WEB EXTRACTOR - AN EFFICINET APPROACH OF WEB USAGE MINING [J].
Khan, Mahbubul Arefin ;
Haq, Kazi Ariful .
4TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING ( ICACTE 2011), 2011, :117-120
[28]   A New Clustering and Preprocessing for Web Log Mining [J].
Maheswari, B. Uma ;
Sumathi, P. .
2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014), 2014, :25-+
[29]   Mining web logs for recommender a personalized system [J].
Puntheeranurak, S ;
Tsuji, H .
ITRE 2005: 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: RESEARCH AND EDUCATION, PROCEEDINGS, 2005, :445-448
[30]   Web Usage Mining: users' navigational patterns extraction from web logs using Ant-based Clustering Method [J].
Etminani, Kobra ;
Akbarzadeh-T, Mohammad-R. ;
Yanehsari, Noorali Raeeji .
PROCEEDINGS OF THE JOINT 2009 INTERNATIONAL FUZZY SYSTEMS ASSOCIATION WORLD CONGRESS AND 2009 EUROPEAN SOCIETY OF FUZZY LOGIC AND TECHNOLOGY CONFERENCE, 2009, :396-401