Improved Mutual Information Method For Text Feature Selection

被引:0
|
作者
Ding Xiaoming [1 ]
Tang Yan [1 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing 400715, Peoples R China
来源
PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013) | 2013年
关键词
text classification; feature selection; mutual information;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Reducing the dimensions of high-dimensional feature set is one of the difficulties of text categorization. Feature selection has been effectively applied in text classification, because of its low complexity of computing. Research works show that mutual information is a good feature selection method but doesn't consider the term frequency in each category of the corpus and the connections between terms. To remedying the defects of traditional mutual information method, this article improved measure of mutual information by introducing the feature frequency in class and the dispersion of feature in class, and built a experimental platform by constructing a Chinese text classification system, and did a multi-set of experiments base on this system. The results show that the new feature selection approach has a more excellent effect in text categorization.
引用
收藏
页码:163 / 166
页数:4
相关论文
共 50 条
  • [11] A new feature selection method for handling redundant information in text classification
    Wang, You-wei
    Feng, Li-zhou
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (02) : 221 - 234
  • [12] Modified Pointwise Mutual Information-Based Feature Selection for Text Classification
    Georgieva-Trifonova, Tsvetanka
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2021, VOL 2, 2022, 359 : 333 - 353
  • [13] An Improved Feature Selection Algorithm with Conditional Mutual Information for Classification Problems
    Palanichamy, Jaganathan
    Ramasamy, Kuppuchamy
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
  • [14] A novel feature selection method based on normalized mutual information
    La The Vinh
    Lee, Sungyoung
    Park, Young-Tack
    d'Auriol, Brian J.
    APPLIED INTELLIGENCE, 2012, 37 (01) : 100 - 120
  • [15] An Effective Feature Selection Method via Mutual Information Estimation
    Yang, Jian-Bo
    Ong, Chong-Jin
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (06): : 1550 - 1559
  • [16] A feature selection method using a fuzzy mutual information measure
    Grande, Javier
    Suarez, Maria del Rosario
    Villar, Jose Ramon
    INNOVATIONS IN HYBRID INTELLIGENT SYSTEMS, 2007, 44 : 56 - +
  • [17] An Improved Feature Selection for Categorization Based on Mutual Information
    Liu, Haifeng
    Su, Zhan
    Yao, Zeqing
    Liu, Shousheng
    WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 80 - 87
  • [18] A novel feature selection method based on normalized mutual information
    La The Vinh
    Sungyoung Lee
    Young-Tack Park
    Brian J. d’Auriol
    Applied Intelligence, 2012, 37 : 100 - 120
  • [19] Feature selection with dynamic mutual information
    Liu, Huawen
    Sun, Jigui
    Liu, Lei
    Zhang, Huijie
    PATTERN RECOGNITION, 2009, 42 (07) : 1330 - 1339
  • [20] On Estimating Mutual Information for Feature Selection
    Schaffernicht, Erik
    Kaltenhaeuser, Robert
    Verma, Saurabh Shekhar
    Gross, Horst-Michael
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 362 - +