Two feature weighting approaches for naive Bayes text classifiers

被引:79
作者
Zhang, Lungan [1 ]
Jiang, Liangxiao [1 ,2 ]
Li, Chaoqun [3 ]
Kong, Ganggang [1 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Dept Math, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Naive Bayes text classifiers; Feature weighting; Gain ratio; Decision tree;
D O I
10.1016/j.knosys.2016.02.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper works on feature weighting approaches for naive Bayes text classifiers. Almost all existing feature weighting approaches for naive Bayes text classifiers have some defects: limited improvement to classification performance of naive Bayes text classifiers or sacrificing the simplicity and execution time of the final models. In fact, feature weighting is not new for machine learning community, and many researchers have made fruitful efforts in the field of feature weighting. This paper reviews some simple and efficient feature weighting approaches designed for standard naive Bayes classifiers, and adapts them for naive Bayes text classifiers. As a result, this paper proposes two adaptive feature weighting approaches for naive Bayes text classifiers. Experimental results based on benchmark and real-world data show that, compared to their competitors, our feature weighting approaches show higher classification accuracy, yet at the same time maintain the simplicity and lower execution time of the final models. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:137 / 144
页数:8
相关论文
共 36 条
  • [31] Feature selection method using improved CHI Square on Arabic text classifiers: analysis and application
    Alshaer, Hadeel N.
    Otair, Mohammed A.
    Abualigah, Laith
    Alshinwan, Mohammad
    Khasawneh, Ahmad M.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (07) : 10373 - 10390
  • [32] Feature selection method using improved CHI Square on Arabic text classifiers: analysis and application
    Hadeel N. Alshaer
    Mohammed A. Otair
    Laith Abualigah
    Mohammad Alshinwan
    Ahmad M. Khasawneh
    Multimedia Tools and Applications, 2021, 80 : 10373 - 10390
  • [33] ARRAY: Adaptive triple feature-weighted transfer Naive Bayes for cross-project defect prediction
    Tong, Haonan
    Lu, Wei
    Xing, Weiwei
    Wang, Shihai
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 202
  • [34] The Comparation of Text Mining With Naive Bayes Classifier, Nearest Neighbor, and Decision Tree to Detect Indonesian Swear Words on Twitter
    Zulfikar, Wildan Budiawan
    Irfan, Mohamad
    Alam, Cecep Nurul
    Indra, Muhammad
    2017 5TH INTERNATIONAL CONFERENCE ON CYBER AND IT SERVICE MANAGEMENT (CITSM 2017), 2017, : 50 - 54
  • [35] A novel hybrid artificial intelligence approach based on the rotation forest ensemble and naive Bayes tree classifiers for a landslide susceptibility assessment in Langao County, China
    Chen, Wei
    Shirzadi, Ataollah
    Shahabi, Himan
    Bin Ahmad, Baharin
    Zhang, Shuai
    Hong, Haoyuan
    Zhang, Ning
    GEOMATICS NATURAL HAZARDS & RISK, 2017, 8 (02) : 1955 - 1977
  • [36] Feature-Weighting-Based Prediction of Drought Occurrence via Two-Stage Particle Swarm Optimization
    Sundararajan, Karpagam
    Srinivasan, Kathiravan
    SUSTAINABILITY, 2023, 15 (02)