An improved method of term weighting for text classification

被引:5
作者
Jiang, Hua [1 ]
Li, Ping [1 ]
Hu, Xin [1 ]
Wang, Shuyan [1 ]
机构
[1] NE Normal Univ, Sch Comp Sci, Changchun, Jilin Province, Peoples R China
来源
2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1 | 2009年
关键词
Text classification; tf-idf; term weighting; kNN;
D O I
10.1109/ICICISYS.2009.5357842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In text classification, term weighting methods design appropriate weights to the given terms to improve the text classification performance Traditional algorithm of term weighting only considers about tf (term frequency), idf (Inverse document frequency) and so on, and this approach simply thinks low frequency terms are Important, high frequency terms are unimportant, so it designs higher weights to the rare terms frequently In this paper, we present an effective term weighting approach to avoid the deficiency of the traditional approach, and make use of kNN classifiers to classify over widely-used benchmark data set Reuters-21578 The experimental results prove that,the new approach can Improve the accuracy of classification
引用
收藏
页码:294 / 298
页数:5
相关论文
共 50 条
[41]   A Study of Applying Different Term Weighting Schemes on Arabic Text Classification [J].
Guru, D. S. ;
Ali, Mostafa ;
Suhil, Mahamad ;
Hazman, Maryam .
DATA ANALYTICS AND LEARNING, 2019, 43 :293-305
[42]   INVESTIGATING TERM WEIGHTING SCHEMES ON THE CLASSIFICATION PERFORMANCE FOR THE IMBALANCED TEXT DATA [J].
Al Manei, Afra ;
Al Hasani, Iman ;
Wesonga, Ronald .
ADVANCES AND APPLICATIONS IN STATISTICS, 2022, 78 :63-82
[43]   Two novel term weighting for text categorization [J].
Matsunaga, L. A. ;
Ebecken, N. F. F. .
DATA MINING IX: DATA MINING, PROTECTION, DETECTION AND OTHER SECURITY TECHNOLOGIES, 2008, 40 :105-114
[44]   A new term weighting scheme for text categorisation [J].
Barigou, Fatiha .
International Journal of Intelligent Systems Technologies and Applications, 2015, 14 (3-4) :256-272
[45]   Turning from TF-IDF to TF-IGM for term weighting in text classification [J].
Chen, Kewen ;
Zhang, Zuping ;
Long, Jun ;
Zhang, Hao .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 66 :245-260
[46]   A generic multi-level framework for building term-weighting schemes in text classification [J].
Tang, Zhong .
COMPUTER JOURNAL, 2024, 67 (11) :3042-3055
[47]   Supporting Text Retrieval by Typographical Term Weighting [J].
Werner, Lars ;
Boettcher, Stefan .
INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2007, 3 (02) :1-16
[48]   RETRACTED: Improved TFIDF weighting for imbalanced biomedical text classification (Retracted Article) [J].
Xu, Guixian ;
Gao, Xu ;
Zhang, Xin ;
Zhao, Xiaobing .
2011 INTERNATIONAL CONFERENCE ON ENERGY AND ENVIRONMENTAL SCIENCE-ICEES 2011, 2011, 11 :2360-2367
[49]   Improved Algorithm Based on TFIDF in Text Classification [J].
Jiang, Hao ;
Li, Wenqiang .
MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 :1791-1794
[50]   The Effects of Globalization Functions on Feature Weighting for Text Classification [J].
Dogan, Turgut ;
Uysal, Alper Kursat .
2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,