An improved method of term weighting for text classification

被引:5
作者
Jiang, Hua [1 ]
Li, Ping [1 ]
Hu, Xin [1 ]
Wang, Shuyan [1 ]
机构
[1] NE Normal Univ, Sch Comp Sci, Changchun, Jilin Province, Peoples R China
来源
2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1 | 2009年
关键词
Text classification; tf-idf; term weighting; kNN;
D O I
10.1109/ICICISYS.2009.5357842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In text classification, term weighting methods design appropriate weights to the given terms to improve the text classification performance Traditional algorithm of term weighting only considers about tf (term frequency), idf (Inverse document frequency) and so on, and this approach simply thinks low frequency terms are Important, high frequency terms are unimportant, so it designs higher weights to the rare terms frequently In this paper, we present an effective term weighting approach to avoid the deficiency of the traditional approach, and make use of kNN classifiers to classify over widely-used benchmark data set Reuters-21578 The experimental results prove that,the new approach can Improve the accuracy of classification
引用
收藏
页码:294 / 298
页数:5
相关论文
共 50 条
[21]   Modified DFS-based term weighting scheme for text classification [J].
Chen, Long ;
Jiang, Liangxiao ;
Li, Chaoqun .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
[22]   Class-indexing-based term weighting for automatic text classification [J].
Ren, Fuji ;
Sohrab, Mohammad Golam .
INFORMATION SCIENCES, 2013, 236 :109-125
[23]   A Term Weighting Scheme Approach for Vietnamese Text Classification [J].
Vu Thanh Nguyen ;
Nguyen Tri Hai ;
Nguyen Hoang Nghia ;
Tuan Dinh Le .
FUTURE DATA AND SECURITY ENGINEERING, FDSE 2015, 2015, 9446 :46-53
[24]   An Improved Term Weighting Scheme for Sentiment Classification [J].
Zhang, Pu ;
Wang, Yinghao ;
Wang, Junxia ;
Zeng, Xianhua ;
Wang, Yong .
2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2017, :462-466
[25]   A Term Weighting Method for Identifying Emotions From Text Content [J].
De Silva, Jenomi ;
Haddela, Prasanna S. .
2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, :381-+
[26]   Improving Term Weighting Schemes for Short Text Classification in Vector Space Model [J].
Samant, Surender Singh ;
Murthy, N. L. Bhanu ;
Malapati, Aruna .
IEEE ACCESS, 2019, 7 :166578-166592
[27]   Combining supervised term-weighting metrics for SVM text classification with extended term representation [J].
Mounia Haddoud ;
Aïcha Mokhtari ;
Thierry Lecroq ;
Saïd Abdeddaïm .
Knowledge and Information Systems, 2016, 49 :909-931
[28]   Combining supervised term-weighting metrics for SVM text classification with extended term representation [J].
Haddoud, Mounia ;
Mokhtari, Aicha ;
Lecroq, Thierry ;
Abdeddaim, Said .
KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 49 (03) :909-931
[29]   PU text classification enhanced by term frequency-inverse document frequency-improved weighting [J].
Peng, Tao ;
Liu, Lu ;
Zuo, Wanli .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (03) :728-741
[30]   A probabilistic model derived term weighting scheme for text classification [J].
Feng, Guozhong ;
Li, Shaoting ;
Sun, Tieli ;
Zhang, Bangzuo .
PATTERN RECOGNITION LETTERS, 2018, 110 :23-29