An improved method of term weighting for text classification

被引:5
|
作者
Jiang, Hua [1 ]
Li, Ping [1 ]
Hu, Xin [1 ]
Wang, Shuyan [1 ]
机构
[1] NE Normal Univ, Sch Comp Sci, Changchun, Jilin Province, Peoples R China
来源
2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1 | 2009年
关键词
Text classification; tf-idf; term weighting; kNN;
D O I
10.1109/ICICISYS.2009.5357842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In text classification, term weighting methods design appropriate weights to the given terms to improve the text classification performance Traditional algorithm of term weighting only considers about tf (term frequency), idf (Inverse document frequency) and so on, and this approach simply thinks low frequency terms are Important, high frequency terms are unimportant, so it designs higher weights to the rare terms frequently In this paper, we present an effective term weighting approach to avoid the deficiency of the traditional approach, and make use of kNN classifiers to classify over widely-used benchmark data set Reuters-21578 The experimental results prove that,the new approach can Improve the accuracy of classification
引用
收藏
页码:294 / 298
页数:5
相关论文
共 50 条
  • [1] An improved term weighting scheme for text classification
    Tang, Zhong
    Li, Wenqiang
    Li, Yan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (09):
  • [2] An improved term weighting method based on relevance frequency for text classification
    Li, Chuanxiao
    Li, Wenqiang
    Tang, Zhong
    Li, Song
    Xiang, Hai
    SOFT COMPUTING, 2023, 27 (07) : 3563 - 3579
  • [3] An improved term weighting method based on relevance frequency for text classification
    Chuanxiao Li
    Wenqiang Li
    Zhong Tang
    Song Li
    Hai Xiang
    Soft Computing, 2023, 27 : 3563 - 3579
  • [4] RANDOM WALK TERM WEIGHTING FOR IMPROVED TEXT CLASSIFICATION
    Hassan, Samer
    Mihalcea, Rada
    Banea, Carmen
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2007, 1 (04) : 421 - 439
  • [5] Random-walk term weighting for improved text classification
    Hassan, Samer
    Mihalcea, Rada
    Banea, Carmen
    ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 242 - +
  • [6] An improved supervised term weighting scheme for text representation and classification
    Tang, Zhong
    Li, Wenqiang
    Li, Yan
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 189
  • [7] Improved inverse gravity moment term weighting for text classification
    Dogan, Turgut
    Uysal, Alper Kursat
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 130 : 45 - 59
  • [8] Supervised term-category feature weighting for improved text classification
    Attieh, Joseph
    Tekli, Joe
    KNOWLEDGE-BASED SYSTEMS, 2023, 261
  • [9] Adaptable Term Weighting Framework for Text Classification
    Huynh, Dat
    Dat Tran
    Ma, Wanli
    Sharma, Dharmendra
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 254 - 265
  • [10] A survey of term weighting schemes for text classification
    Alsaeedi, Abdullah
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2020, 12 (02) : 237 - 254