Self-adaptive attribute weighting for Naive Bayes classification

被引:80
作者
Wu, Jia [1 ,2 ]
Pan, Shirui [2 ]
Zhu, Xingquan [3 ]
Cai, Zhihua [1 ]
Zhang, Peng [2 ]
Zhang, Chengqi [2 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Quantum Computat & Intelligent Syst QCIS Ctr, Sydney, NSW 2007, Australia
[3] Florida Atlantic Univ, Dept Comp & Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
基金
澳大利亚研究理事会; 中国国家自然科学基金;
关键词
Naive Bayes; Self-adaptive; Attribute weighting; Artificial Immune Systems; Evolutionary computing; ARTIFICIAL IMMUNE-SYSTEM; NETWORK; OPTIMIZATION; CLASSIFIERS; ALGORITHM; EVOLUTION; RELIEFF; AREA;
D O I
10.1016/j.eswa.2014.09.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Naive Bayes (NB) is a popular machine learning tool for classification, due to its simplicity, high computational efficiency, and good classification accuracy, especially for high dimensional data such as texts. In reality, the pronounced advantage of NB is often challenged by the strong conditional independence assumption between attributes, which may deteriorate the classification performance. Accordingly, numerous efforts have been made to improve NB, by using approaches such as structure extension, attribute selection, attribute weighting, instance weighting, local learning and so on. In this paper, we propose a new Artificial Immune System (AIS) based self-adaptive attribute weighting method for Naive Bayes classification. The proposed method, namely AISWNB, uses immunity theory in Artificial Immune Systems to search optimal attribute weight values, where self-adjusted weight values will alleviate the conditional independence assumption and help calculate the conditional probability in an accurate way. One noticeable advantage of AISWNB is that the unique immune system based evolutionary computation process, including initialization, clone, section, and mutation, ensures that AISWNB can adjust itself to the data without explicit specification of functional or distributional forms of the underlying model. As a result, AISWNB can obtain good attribute weight values during the learning process. Experiments and comparisons on 36 machine learning benchmark data sets and six image classification data sets demonstrate that AISWNB significantly outperforms its peers in classification accuracy, class probability estimation, and class ranking performance. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1487 / 1502
页数:16
相关论文
共 54 条
[1]  
Aickelin U., 2013, CORR
[2]  
[Anonymous], 2005, MORGAN KAUFMANN SERI
[3]  
[Anonymous], 2014, P 2014 SIAM INT C DA
[4]  
Bache K, 2013, UCI machine learning repository
[5]   Feature selection for text classification with Naive Bayes [J].
Chen, Jingnian ;
Huang, Houkuan ;
Tian, Shengfeng ;
Qu, Youli .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :5432-5435
[6]   iLike: Bridging the Semantic Gap in Vertical Image Search by Integrating Text and Visual Features [J].
Chen, Yuxin ;
Sampathkumar, Hariprasad ;
Luo, Bo ;
Chen, Xue-wen .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (10) :2257-2270
[7]   Automatic multiple circle detection based on artificial immune systems [J].
Cuevas, Erik ;
Osuna-Enciso, Valentin ;
Wario, Fernando ;
Zaldivar, Daniel ;
Perez-Cisneros, Marco .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) :713-722
[8]  
de Castro L., 2002, Artificial Neural Networks in Pattern Recognition, P67
[9]   Diagnosis of chest diseases using artificial immune system [J].
Er, Orhan ;
Yumusak, Nejat ;
Temurtas, Feyzullah .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) :1862-1868
[10]   Bayesian network classifiers [J].
Friedman, N ;
Geiger, D ;
Goldszmidt, M .
MACHINE LEARNING, 1997, 29 (2-3) :131-163