A method of the feature selection in hierarchical text classification based on the category discrimination and position information

被引:10
作者
Song, Jia [1 ]
Zhang, Pengzhou [1 ]
Qin, Sijun [2 ]
Gong, Junpeng [1 ]
机构
[1] Commun Univ China, Fac Sci & Technol, Beijing, Peoples R China
[2] Commun Univ China, New Media Inst, Beijing, Peoples R China
来源
2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII) | 2015年
关键词
feature selection; category discrimination; information gain; hierarchical text classification;
D O I
10.1109/ICIICII.2015.116
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Feature dimension reduction is an important part in text categorization, and it even becomes more important for child classification in hierarchical text classification. It is presented that Chinese text feature selection method based on category distinction and feature location information in this paper. Experimental results show that the proposed method has a higher precision and recall rate than the others. Therefore the effect of the feature selection is better.
引用
收藏
页码:132 / +
页数:5
相关论文
共 13 条
  • [1] [Anonymous], 1997, ICML
  • [2] On the optimality of the simple Bayesian classifier under zero-one loss
    Domingos, P
    Pazzani, M
    [J]. MACHINE LEARNING, 1997, 29 (2-3) : 103 - 130
  • [3] Galavotti L, 2000, LECT NOTES COMPUT SC, V1923, P59
  • [4] Some effective techniques for naive Bayes text classification
    Kim, Sang-Bum
    Han, Kyoung-Soo
    Rim, Hae-Chang
    Myaeng, Sung Hyon
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (11) : 1457 - 1466
  • [5] Liu H. F., 2008, APPL RES COMPUTERS, V25, P93
  • [6] [刘海峰 LIU Haifeng], 2007, [情报科学, Information Science], V25, P451
  • [7] Lu Ting, 2011, Computer Engineering and Applications, V47, P127, DOI 10.3778/j.issn.1002-8331.2011.02.040
  • [8] Mitchell T. M., 2003, Machine Learning
  • [9] [秦进 Qin Jin], 2003, [计算机应用, Computer Applications], V23, P45
  • [10] Performance standards and evaluations in IR test collections: Vector-space and other retrieval models
    Shaw, WM
    Burgin, R
    Howell, P
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1997, 33 (01) : 15 - 36