Getting the Most Out of Social Annotations for Web Page Classification

被引:0
作者
Zubiaga, Arkaitz
Martinez, Raquel
Fresno, Victor
机构
来源
DOCENG'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON DOCUMENT ENGINEERING | 2009年
关键词
social bookmarking; social annotations; web page classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
User-generated annotations on social bookmarking sites can provide interesting and promising metadata for web document management tasks like web page classification. These user-generated annotations include diverse types of information, such as tags and comments. Nonetheless, each kind of annotation has a different nature and popularity level. In this work, we analyze and evaluate the usefulness of each of these social annotations to classify web pages over a taxonomy like that proposed by the Open Directory Project. We compare them separately to the content-based classification, and also combine the different types of data to augment performance. Our experiments show encouraging results with the use of social annotations for this purpose, and we found that combining these metadata with web page content improves even more the classifier's performance.
引用
收藏
页码:74 / 83
页数:10
相关论文
共 18 条
[1]  
[Anonymous], 2007, P 16 INT C WORLD WID
[2]  
[Anonymous], 2003, P SIGIR
[3]  
[Anonymous], 2002, P 11 WWW
[4]  
Au Yeung ChingMan., 2008, Proceedings of the Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR 2008), co-located with ECIR 2008, Glasgow, United Kingdom, 31 March, 2008, P48
[5]  
Ben-Yitzhak Ori., 2008, P 2008 INT C WEB SEA, P44
[6]   Usage patterns of collaborative tagging systems [J].
Golder, SA ;
Huberman, BA .
JOURNAL OF INFORMATION SCIENCE, 2006, 32 (02) :198-208
[7]  
Hotho A, 2006, LECT NOTES COMPUT SC, V4011, P411
[8]  
Joachims T, 1999, ADVANCES IN KERNEL METHODS, P169
[9]  
Joachims T., 1998, MACHINE LEARNING ECM, P137, DOI DOI 10.1007/BFB0026683
[10]  
NOLL MG, 2008, WEB INT INT AG TECHN, V1, P640