Web Page Classification with Social Annotations

被引:0
|
作者
Zubiaga, Arkaitz [1 ]
Martinez, Raquel [1 ]
Fresno, Victor [1 ]
机构
[1] Univ Nacl Educac Distan, C-Juan Rosal,16, Madrid 20840, Spain
来源
PROCESAMIENTO DEL LENGUAJE NATURAL | 2009年 / 43期
关键词
web page classification; social annotations; social bookmarking;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
User-generated annotations on social bookmarking sites can provide interesting and promising metadata for web page classification. These annotations include diverse types of information, such as tags and comments. Nonetheless, each kind of annotation has a different nature and popularity level. In this work, we analyze and evaluate the usefulness of each of these social annotations to classify web pages over a taxonomy like that by the Open Directory Project. We compare them separately to the content-based classification, and also combine the different types of data. Our experiments show encouraging results with the use of social annotations for this purpose, and we found that combining these metadata with web page content improves even more the classifier's performance.
引用
收藏
页码:225 / 233
页数:9
相关论文
共 50 条
  • [41] Web page classification: a survey of perspectives, gaps, and future directions
    Mahdi Hashemi
    Multimedia Tools and Applications, 2020, 79 : 11921 - 11945
  • [42] Knowledge Based Deep Inception Model for Web Page Classification
    Gupta, Amit
    Bhatia, Rajesh
    JOURNAL OF WEB ENGINEERING, 2021, 20 (07): : 2131 - 2167
  • [43] Research on Web Page Classification Method Based on Query Log
    叶飞跃
    马祎星
    JournalofShanghaiJiaotongUniversity(Science), 2018, 23 (03) : 404 - 410
  • [44] Entity-Based Classification of Web Page in Search Engine
    Liu, Yicen
    Liu, Mingrong
    Xiang, Liang
    Yang, Qing
    Digital Libraries: Universal and Ubiquitous Access to Information, Proceedings, 2008, 5362 : 410 - 411
  • [45] A Novel Approach for Web Page Classification using Optimum features
    Mangai, J. Alamelu
    Kumar, V. Santhosh
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2011, 11 (05): : 252 - 257
  • [46] Experimental Analysis of the Machine Learning Algorithms for Crime Web Page Classification
    Markkandeyan, S.
    Selvam, L.
    Tamizharasu, K.
    Aandi, Senthilkumar
    IETE JOURNAL OF RESEARCH, 2024, 70 (05) : 4890 - 4902
  • [47] CALA: An unsupervised URL-based web page classification system
    Hernandez, Inma
    Rivero, Carlos R.
    Ruiz, David
    Corchuelo, Rafael
    KNOWLEDGE-BASED SYSTEMS, 2014, 57 : 168 - 180
  • [48] Text-Based Web Page Classification with Use of Visual Information
    Bartik, Vladimir
    2010 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2010), 2010, : 416 - 420
  • [49] Web Page Classification Using Relational Learning Algorithm and Unlabeled Data
    Li, Yanjuan
    Guo, Maozu
    JOURNAL OF COMPUTERS, 2011, 6 (03) : 474 - 479
  • [50] Web page classification based on heterogeneous features and a combination of multiple classifiers
    Deng, Li
    Du, Xin
    Shen, Ji-zhong
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (07) : 995 - 1004