An improved method of term weighting for text classification

被引:5
作者
Jiang, Hua [1 ]
Li, Ping [1 ]
Hu, Xin [1 ]
Wang, Shuyan [1 ]
机构
[1] NE Normal Univ, Sch Comp Sci, Changchun, Jilin Province, Peoples R China
来源
2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1 | 2009年
关键词
Text classification; tf-idf; term weighting; kNN;
D O I
10.1109/ICICISYS.2009.5357842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In text classification, term weighting methods design appropriate weights to the given terms to improve the text classification performance Traditional algorithm of term weighting only considers about tf (term frequency), idf (Inverse document frequency) and so on, and this approach simply thinks low frequency terms are Important, high frequency terms are unimportant, so it designs higher weights to the rare terms frequently In this paper, we present an effective term weighting approach to avoid the deficiency of the traditional approach, and make use of kNN classifiers to classify over widely-used benchmark data set Reuters-21578 The experimental results prove that,the new approach can Improve the accuracy of classification
引用
收藏
页码:294 / 298
页数:5
相关论文
共 50 条
[31]   An Extension of Topic Models for Text Classification: a Term Weighting Approach [J].
Lee, Seonggyu ;
Kim, Jinho ;
Myaeng, Sung-Hyon .
2015 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2015, :217-224
[32]   Several alternative term weighting methods for text representation and classification [J].
Tang, Zhong ;
Li, Wenqiang ;
Li, Yan ;
Zhao, Wu ;
Li, Song .
KNOWLEDGE-BASED SYSTEMS, 2020, 207
[33]   Hybridized term-weighting method for Dark Web classification [J].
Sabbah, Thabit ;
Selamat, Ali ;
Selamat, Md. Hafiz ;
Ibrahim, Roliana ;
Fujita, Hamido .
NEUROCOMPUTING, 2016, 173 :1908-1926
[34]   Information Gain Based Term Weighting Method for Multi-label Text Classification Task [J].
Mazyad, Ahmad ;
Teytaud, Fabien ;
Fonlupt, Cyril .
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 :607-615
[35]   Chinese Text Classification with a KNN Classifier Using an Adjusted Feature Weighting Method [J].
Lin, Weiran ;
Wu, Zhihui ;
Feng, Lichao ;
Huang, Waibin .
INTELLIGENT STRUCTURE AND VIBRATION CONTROL, PTS 1 AND 2, 2011, 50-51 :700-+
[36]   Effective Text Classification Through Supervised Rough Set-Based Term Weighting [J].
Cekik, Rasim .
SYMMETRY-BASEL, 2025, 17 (01)
[37]   Grammatical Dependency-Based Relations for Term Weighting in Text Classification [J].
Dat Huynh ;
Dat Tran ;
Ma, Wanli ;
Sharma, Dharmendra .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 :476-487
[38]   Term weighting scheme for short-text classification: Twitter corpuses [J].
Alsmadi, Issa ;
Hoon, Gan Keng .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08) :3819-3831
[39]   A novel term weighting scheme for text classification: TF-MONO [J].
Dogan, Turgut ;
Uysal, Alper Kursat .
JOURNAL OF INFORMETRICS, 2020, 14 (04)
[40]   Term weighting scheme for short-text classification: Twitter corpuses [J].
Issa Alsmadi ;
Gan Keng Hoon .
Neural Computing and Applications, 2019, 31 :3819-3831