Self-adaptive attribute weighting for Naive Bayes classification

被引:80
作者
Wu, Jia [1 ,2 ]
Pan, Shirui [2 ]
Zhu, Xingquan [3 ]
Cai, Zhihua [1 ]
Zhang, Peng [2 ]
Zhang, Chengqi [2 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Quantum Computat & Intelligent Syst QCIS Ctr, Sydney, NSW 2007, Australia
[3] Florida Atlantic Univ, Dept Comp & Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Naive Bayes; Self-adaptive; Attribute weighting; Artificial Immune Systems; Evolutionary computing; ARTIFICIAL IMMUNE-SYSTEM; NETWORK; OPTIMIZATION; CLASSIFIERS; ALGORITHM; EVOLUTION; RELIEFF; AREA;
D O I
10.1016/j.eswa.2014.09.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Naive Bayes (NB) is a popular machine learning tool for classification, due to its simplicity, high computational efficiency, and good classification accuracy, especially for high dimensional data such as texts. In reality, the pronounced advantage of NB is often challenged by the strong conditional independence assumption between attributes, which may deteriorate the classification performance. Accordingly, numerous efforts have been made to improve NB, by using approaches such as structure extension, attribute selection, attribute weighting, instance weighting, local learning and so on. In this paper, we propose a new Artificial Immune System (AIS) based self-adaptive attribute weighting method for Naive Bayes classification. The proposed method, namely AISWNB, uses immunity theory in Artificial Immune Systems to search optimal attribute weight values, where self-adjusted weight values will alleviate the conditional independence assumption and help calculate the conditional probability in an accurate way. One noticeable advantage of AISWNB is that the unique immune system based evolutionary computation process, including initialization, clone, section, and mutation, ensures that AISWNB can adjust itself to the data without explicit specification of functional or distributional forms of the underlying model. As a result, AISWNB can obtain good attribute weight values during the learning process. Experiments and comparisons on 36 machine learning benchmark data sets and six image classification data sets demonstrate that AISWNB significantly outperforms its peers in classification accuracy, class probability estimation, and class ranking performance. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1487 / 1502
页数:16
相关论文
共 54 条
[31]   Real-time computerized annotation of pictures [J].
Li, Jia ;
Wang, James Z. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (06) :985-1002
[32]  
Ling C. X., 2003, In:Ijcai, V3, P519
[33]   Content-based image retrieval using color difference histogram [J].
Liu, Guang-Hai ;
Yang, Jing-Yu .
PATTERN RECOGNITION, 2013, 46 (01) :188-198
[34]   Constructing the Bayesian network structure from dependencies implied in multiple relational schemas [J].
Liu, Wei-Yi ;
Yue, Kun ;
Li, Wei-Hua .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (06) :7123-7134
[35]   Multiview Vector-Valued Manifold Regularization for Multilabel Image Classification [J].
Luo, Yong ;
Tao, Dacheng ;
Xu, Chang ;
Xu, Chao ;
Liu, Hong ;
Wen, Yonggang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (05) :709-722
[36]   Supporting ranked Boolean similarity queries in MARS [J].
Ortega, M ;
Rui, Y ;
Chakrabarti, K ;
Porkaew, K ;
Mehrotra, S ;
Huang, TS .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (06) :905-925
[37]   A Dual-Population Genetic Algorithm for Adaptive Diversity Control [J].
Park, Taejin ;
Ryu, Kwang Ryel .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2010, 14 (06) :865-884
[38]  
Quinlan J, 2014, C4.5: Programs for machine learning
[39]   Theoretical and empirical analysis of ReliefF and RReliefF [J].
Robnik-Sikonja, M ;
Kononenko, I .
MACHINE LEARNING, 2003, 53 (1-2) :23-69
[40]   Differential evolution - A simple and efficient heuristic for global optimization over continuous spaces [J].
Storn, R ;
Price, K .
JOURNAL OF GLOBAL OPTIMIZATION, 1997, 11 (04) :341-359