An effective approach to enhance centroid classifier for text categorization

被引:0
作者
Tan, Songbo
Cheng, Xueqi
机构
来源
KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2007, PROCEEDINGS | 2007年 / 4702卷
关键词
text categorization; information retrieval; data mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Centroid Classifier has been shown to be a simple and yet effective method for text categorization. However, it is often plagued with model misfit (or inductive bias) incurred by its assumption. To address this issue, a novel Model Adjustment algorithm was proposed. The basic idea is to make use of some criteria to adjust Centroid Classifier model. In this work, the criteria include training-set errors as well as training-set margins. The empirical assessment indicates that proposed method performs slightly better than SVM classifier in prediction accuracy, as well as beats it in running time.
引用
收藏
页码:581 / 588
页数:8
相关论文
共 8 条
[1]  
Joachims T., 1998, Lecture Notes in Computer Science, P137, DOI DOI 10.1007/BFB0026683
[2]  
Liu Y., 2002, Proceedings of the Eleventh International Conference on Information and Knowledge Management. CIKM 2002, P348, DOI 10.1145/584792.584850
[3]  
McCallum A., 1998, Workshop on Learning for Text Categorization, V752, P41, DOI DOI 10.1109/TSMC.1985.6313426
[4]  
MUN P, TEXT CLASSIFICATION
[5]   Machine learning in automated text categorization [J].
Sebastiani, F .
ACM COMPUTING SURVEYS, 2002, 34 (01) :1-47
[6]  
SHANKAR S, WEIGHT ADJUSMENT SCH
[7]  
TA S, 2005, ACM CIKM, P469
[8]  
WU H, 2002, P 8 ACM SIGKDD INT C, P207, DOI DOI 10.1145/775047.