Two feature weighting approaches for naive Bayes text classifiers

被引:79
作者
Zhang, Lungan [1 ]
Jiang, Liangxiao [1 ,2 ]
Li, Chaoqun [3 ]
Kong, Ganggang [1 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Dept Math, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Naive Bayes text classifiers; Feature weighting; Gain ratio; Decision tree;
D O I
10.1016/j.knosys.2016.02.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper works on feature weighting approaches for naive Bayes text classifiers. Almost all existing feature weighting approaches for naive Bayes text classifiers have some defects: limited improvement to classification performance of naive Bayes text classifiers or sacrificing the simplicity and execution time of the final models. In fact, feature weighting is not new for machine learning community, and many researchers have made fruitful efforts in the field of feature weighting. This paper reviews some simple and efficient feature weighting approaches designed for standard naive Bayes classifiers, and adapts them for naive Bayes text classifiers. As a result, this paper proposes two adaptive feature weighting approaches for naive Bayes text classifiers. Experimental results based on benchmark and real-world data show that, compared to their competitors, our feature weighting approaches show higher classification accuracy, yet at the same time maintain the simplicity and lower execution time of the final models. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:137 / 144
页数:8
相关论文
共 36 条
  • [1] A New Feature Selection Approach to Naive Bayes Text Classifiers
    Zhang, Lungan
    Jiang, Liangxiao
    Li, Chaoqun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (02)
  • [2] Integrating incremental feature weighting into Naive Bayes text classifier
    Kim, Han Joon
    Chang, Jaeyoung
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 1137 - 1143
  • [3] Advanced Naive Bayes Text Classifier with Embedded Feature Weighting Approach
    Kim, Han-joon
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2009, 12 (03): : 607 - 620
  • [4] Deep feature weighting for naive Bayes and its application to text classification
    Jiang, Liangxiao
    Li, Chaoqun
    Wang, Shasha
    Zhang, Lungan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 52 : 26 - 39
  • [5] A Dependent Feature Weighting Filter for Naive Bayes Classifier
    Chatip, Gieliz
    Yilmaz, Ferkan
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [6] A Correlation-Based Feature Weighting Filter for Naive Bayes
    Jiang, Liangxiao
    Zhang, Lungan
    Li, Chaoqun
    Wu, Jia
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 201 - 213
  • [7] AN INFORMATION-THEORETIC FILTER METHOD FOR FEATURE WEIGHTING IN NAIVE BAYES
    Lee, Chang-Hwan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (05)
  • [8] Weakening Feature Independence of Naive Bayes Using Feature Weighting and Selection on Imbalanced Customer Review Data
    Cahya, Reiza Adi
    Bachtiar, Fitra A.
    2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 182 - 187
  • [9] Feature weighting for naive Bayes using multi objective artificial bee colony algorithm
    Chaudhuri, Abhilasha
    Sahu, Tirath Prasad
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2021, 24 (01) : 74 - 88
  • [10] Deep Feature Weighting Based on Genetic Algorithm and Naive Bayes for Twitter Sentiment Analysis
    Cahya, Reiza Adi
    Adimanggala, Dinda
    Supianto, Ahmad Afif
    PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2019), 2019, : 326 - 331