An Enhanced Pre-Processing Technique for Web Log Mining by Removing Web Robots

被引:0
|
作者
Nithya, P. [1 ]
Sumathi, P. [2 ]
机构
[1] Manonmaniam Sundaranar Univ, Tirunelveli, Tamil Nadu, India
[2] Chikkanna Govt Arts Coll, Tirupur, Tamil Nadu, India
来源
2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC) | 2012年
关键词
Preprocessing; Data Cleaning; Path Completion; Travel Path set; Content Path Set;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, internet becomes useful source of information in day-to-day life. It creates huge development of World Wide Web in its quantity of interchange and its size and difficulty of websites. Web Usage Mining (WUM) is one of the main applications of data mining, artificial intelligence and so on to the web data and forecast the user's visiting behaviors and obtains their interests by investigating the samples. Since WUM directly involves in large range of applications, such as, e-commerce, e-learning, Web analytics, information retrieval etc. Weblog data is one of the major sources which contain all the information regarding the users visited links, browsing patterns, time spent on a particular page or link and this information can be used in several applications like adaptive web sites, modified services, customer summary, pre-fetching, generate attractive web sites etc. There are several problems related with the existing web usage mining approaches. Existing web usage mining algorithms suffer from difficulty of practical applicability. So, a novel research is necessary for the accurate prediction of future performance of web users with rapid execution time. WUM consists of preprocessing, pattern discovery and pattern analysis. Log data is characteristically noisy and unclear. Hence, preprocessing is an essential process for effective mining process. In this paper, a novel pre-processing technique is proposed by removing local and global noise and web robots. Anonymous Microsoft Web Dataset and MSNBC.com Anonymous Web Dataset are used for estimating the proposed preprocessing technique.
引用
收藏
页码:662 / 665
页数:4
相关论文
共 50 条
  • [1] Novel Pre-Processing Technique for Web Log Mining by Removing Global Noise and Web Robots
    Nithya, P.
    Sumathi, P.
    2012 NATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION SYSTEMS (NCCCS), 2012, : 41 - 45
  • [2] Pre-Processing of Query Logs in Web Usage Mining
    Abdullah, Norhaiza Ya
    Husin, Husna Sarirah
    Ramadhani, Herny
    Nadarajan, Shanmuga Vivekanada
    INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2012, 11 (01): : 82 - 86
  • [3] Efficient Management of Web Data by Applying Web Mining Pre-processing Methodologies
    Kaur, Jaswinder
    Garg, Kanwal
    SOFTWARE ENGINEERING (CSI 2015), 2019, 731 : 115 - 122
  • [4] An Efficient Method in Pre-processing Phase of Mining Suspicious Web Crawlers
    Catalin, Mironeanu
    Cristian, Aflori
    2017 21ST INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2017, : 272 - 277
  • [5] A pre-processing tool for web usage mining in the distance education domain
    Marquardt, CG
    Becker, K
    Ruiz, DD
    INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2004, : 78 - 87
  • [6] A data pre-processing method for web content mining based on XML
    Zhang, Zhonglin
    Chen, Zhi
    2007 International Symposium on Computer Science & Technology, Proceedings, 2007, : 525 - 528
  • [7] INFLUENCE OF RATIO OF AUXILIARY PAGES ON THE PRE-PROCESSING PHASE OF WEB USAGE MINING
    Munk, Michel
    Benko, L'ubomir
    Gangur, Mikulas
    Turcani, Milan
    E & M EKONOMIE A MANAGEMENT, 2015, 18 (03): : 144 - 159
  • [8] On the existence and significance of data pre-processing biases in web-usage mining
    Zheng, ZQ
    Padmanabhan, B
    Kimbrough, SO
    INFORMS JOURNAL ON COMPUTING, 2003, 15 (02) : 148 - 170
  • [9] Overview: Web log Mining, Privacy Issues and Application of Web Log Mining
    Singh, Amarjeet
    Sreeram, Y. Chaitanya
    2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2014, : 638 - 641
  • [10] STUDY ON DATA PRE-PROCESSING IN WEB MINING BASED E-COMMERCE RECOMMENDATION SYSTEMS
    Ya, Luo
    2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 667 - 670