Data Mining Techniques for Web Page Classification

被引:0
作者
Fiol-Roig, Gabriel [1 ]
Miro-Julia, Margaret [1 ]
Herraiz, Eduardo [1 ]
机构
[1] Univ Illes Balears, Math & Comp Sci Dept, Palma De Mallorca 07122, Spain
来源
HIGHLIGHTS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS | 2011年 / 89卷
关键词
Data Mining; Artificial Intelligence; Decision Trees; Web page classification; TABLES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, the Web is an essential tool for most people. Internet provides millions of web pages for each and every search term. The Internet is a powerful medium for communication between computers and accessing online documents but it is not a tool for locating or organizing information. Tools like search engines assist users in locating information. The amount of daily searches on the web is broad and the task of getting interesting and required results quickly becomes very difficult. The use of an automatic web page classifier can simplify the process by assisting the search engine in getting relevant results. The web pages can present different and varied information depending on the characteristics of its content. The uncontrolled nature of web content presents additional challenges to web page classification as compared to traditional text classification, but the interconnected nature of hypertext also provides features that can assist the process. This paper analyses the feasibility of an automatic web page classifier, proposes several classifiers and studies their precision. In this sense, Data Mining techniques are of great importance and will be used to construct the classifiers.
引用
收藏
页码:61 / 68
页数:8
相关论文
共 8 条
  • [1] [Anonymous], 2003, Data Mining: Introductory and Advanced Topics
  • [2] FIOL G, 1999, LECT NOTES ARTIF INT, V1609, P601
  • [3] Fiol-Roig G, 2004, FRONT ARTIF INTEL AP, V113, P145
  • [4] Miró-Julià M, 2005, LECT NOTES COMPUT SC, V3643, P21, DOI 10.1007/11556985_4
  • [5] Miró-Juliá M, 2003, LECT NOTES COMPUT SC, V2652, P556
  • [6] Classification Using Intelligent Approaches. An Example in Social Assistance
    Miro-Julia, Margaret
    Fiol-Roig, Gabriel
    Vaquer-Ferrer, Damia
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2009, 202 : 138 - 146
  • [7] Web Page Classification: Features and Algorithms
    Qi, Xiaoguang
    Davison, Brian D.
    [J]. ACM COMPUTING SURVEYS, 2009, 41 (02)
  • [8] Witten I. H., 2005, DATA MINING, V2, P403