AdaBoost ranking results improvement by pairwise classifiers for web page classification

被引:0
作者
Ga̧ciarz T. [1 ]
Czajkowski K. [1 ]
Niebylski M. [1 ]
机构
[1] Institute of Telecomputing, Faculty of Physics, Mathematics and Computer Science, Cracow University of Technology
来源
Advances in Intelligent and Soft Computing | 2011年 / 103卷
关键词
AdaBoost; classification; features extraction; Web page;
D O I
10.1007/978-3-642-23169-8_43
中图分类号
学科分类号
摘要
The article concerns the analysis of information describing the web pages. The aim of the analysis is to support the process of their classification. Pages belonging to the specific class are characterized by the similar 'style' in terms of the form or the type of content presentation. Various characteristics are taken into account including inter alia, structural, visual, text, web and links features. During the construction of classifiers the AdaBoost algorithm was applied to create ranking list of classifiers. Then the pairwise classifiers were used to improve final classification. The paper presents the implementation of this solution and the results of experiments. © 2011 Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:393 / 400
页数:7
相关论文
共 12 条
[11]  
Xue W., Huang W., Lu Y., Application of svm in web page categorization, Proceedings of the IEEE International Conference on Granular Computing, pp. 469-472, (2006)
[12]  
Yin S., Wang F., Xie Z., Qiu Y., Study on web-page classification algorithm based on rough set theory, Proceedings of International Symposium on Information Processing (ISIP), pp. 202-206, (2008)