An improved method of term weighting for text classification

被引:5
作者
Jiang, Hua [1 ]
Li, Ping [1 ]
Hu, Xin [1 ]
Wang, Shuyan [1 ]
机构
[1] NE Normal Univ, Sch Comp Sci, Changchun, Jilin Province, Peoples R China
来源
2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1 | 2009年
关键词
Text classification; tf-idf; term weighting; kNN;
D O I
10.1109/ICICISYS.2009.5357842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In text classification, term weighting methods design appropriate weights to the given terms to improve the text classification performance Traditional algorithm of term weighting only considers about tf (term frequency), idf (Inverse document frequency) and so on, and this approach simply thinks low frequency terms are Important, high frequency terms are unimportant, so it designs higher weights to the rare terms frequently In this paper, we present an effective term weighting approach to avoid the deficiency of the traditional approach, and make use of kNN classifiers to classify over widely-used benchmark data set Reuters-21578 The experimental results prove that,the new approach can Improve the accuracy of classification
引用
收藏
页码:294 / 298
页数:5
相关论文
共 50 条
  • [21] Modified DFS-based term weighting scheme for text classification
    Chen, Long
    Jiang, Liangxiao
    Li, Chaoqun
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [22] Class-indexing-based term weighting for automatic text classification
    Ren, Fuji
    Sohrab, Mohammad Golam
    [J]. INFORMATION SCIENCES, 2013, 236 : 109 - 125
  • [23] A Term Weighting Scheme Approach for Vietnamese Text Classification
    Vu Thanh Nguyen
    Nguyen Tri Hai
    Nguyen Hoang Nghia
    Tuan Dinh Le
    [J]. FUTURE DATA AND SECURITY ENGINEERING, FDSE 2015, 2015, 9446 : 46 - 53
  • [24] An Improved Term Weighting Scheme for Sentiment Classification
    Zhang, Pu
    Wang, Yinghao
    Wang, Junxia
    Zeng, Xianhua
    Wang, Yong
    [J]. 2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2017, : 462 - 466
  • [25] A Term Weighting Method for Identifying Emotions From Text Content
    De Silva, Jenomi
    Haddela, Prasanna S.
    [J]. 2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 381 - +
  • [26] Improving Term Weighting Schemes for Short Text Classification in Vector Space Model
    Samant, Surender Singh
    Murthy, N. L. Bhanu
    Malapati, Aruna
    [J]. IEEE ACCESS, 2019, 7 : 166578 - 166592
  • [27] Combining supervised term-weighting metrics for SVM text classification with extended term representation
    Mounia Haddoud
    Aïcha Mokhtari
    Thierry Lecroq
    Saïd Abdeddaïm
    [J]. Knowledge and Information Systems, 2016, 49 : 909 - 931
  • [28] Combining supervised term-weighting metrics for SVM text classification with extended term representation
    Haddoud, Mounia
    Mokhtari, Aicha
    Lecroq, Thierry
    Abdeddaim, Said
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (03) : 909 - 931
  • [29] PU text classification enhanced by term frequency-inverse document frequency-improved weighting
    Peng, Tao
    Liu, Lu
    Zuo, Wanli
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (03) : 728 - 741
  • [30] A probabilistic model derived term weighting scheme for text classification
    Feng, Guozhong
    Li, Shaoting
    Sun, Tieli
    Zhang, Bangzuo
    [J]. PATTERN RECOGNITION LETTERS, 2018, 110 : 23 - 29