A data pre-processing method for web content mining based on XML

被引:0
|
作者
Zhang, Zhonglin [1 ]
Chen, Zhi [1 ]
机构
[1] Lanzhou Jiaotong Univ, Sch Elect & Informat Engn, Lanzhou 730070, Peoples R China
来源
2007 International Symposium on Computer Science & Technology, Proceedings | 2007年
关键词
Web content mining; XML; data pre-processing; Web documents;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Web mining is one of the hottest research topics in the field of data mining. As for the characteristics of Web documents, it is necessary to pre-process the data in order to analyze them efficiently. This paper argues a data pre-processing method that use XML as an agent, and discusses the process of that kind data pre-processing of Web content mining.
引用
收藏
页码:525 / 528
页数:4
相关论文
共 50 条
  • [11] An Enhanced Pre-Processing Technique for Web Log Mining by Removing Web Robots
    Nithya, P.
    Sumathi, P.
    2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2012, : 662 - 665
  • [12] Toward databases mining: Pre-processing collected data
    Yan, XW
    Zhang, CQ
    Zhang, SC
    APPLIED ARTIFICIAL INTELLIGENCE, 2003, 17 (5-6) : 545 - 561
  • [13] Survey of Pre-processing Techniques for Mining Big Data
    Hariharakrishnan, Jayaram
    Mohanavalli, S.
    Srividya
    Kumar, Sundhara K. B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND SIGNAL PROCESSING (ICCCSP), 2017, : 77 - 81
  • [14] A data pre-processing method based on multi-threshold
    Su-bida
    Wang-shuhua
    Wang-Jingfeng
    Zhong-Hua
    Deng-Rong
    Hua-Hao
    Yang-suhui
    INTERNATIONAL SYMPOSIUM ON OPTOELECTRONIC TECHNOLOGY AND APPLICATION 2014: OPTICAL REMOTE SENSING TECHNOLOGY AND APPLICATIONS, 2014, 9299
  • [15] Importance of Data Pre-processing in Credit Scoring Models Based on Data Mining Approaches
    Nalic, Jasmina
    Svraka, Amar
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 1046 - 1051
  • [16] IRPDP_HT2: a scalable data pre-processing method in web usage mining using Hadoop MapReduce
    Srivastava, Atul Kumar
    Srivastava, Mitali
    SOFT COMPUTING, 2023, 27 (12) : 7907 - 7923
  • [17] IRPDP_HT2: a scalable data pre-processing method in web usage mining using Hadoop MapReduce
    Atul Kumar Srivastava
    Mitali Srivastava
    Soft Computing, 2023, 27 : 7907 - 7923
  • [18] A survey on pre-processing and post-processing techniques in data mining
    Tomar, Divya
    Agarwal, Sonali
    International Journal of Database Theory and Application, 2014, 7 (04): : 99 - 128
  • [19] Research of Web Data Mining Based on XML
    Gu, LiFen
    Meng, JunXia
    THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 382 - 385
  • [20] A pre-processing tool for web usage mining in the distance education domain
    Marquardt, CG
    Becker, K
    Ruiz, DD
    INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2004, : 78 - 87