Ensemble classifier generation using non-uniform layered clustering and Genetic Algorithm

被引:42
作者
Rahman, Ashfaqur [1 ]
Verma, Brijesh [2 ]
机构
[1] CSIRO, Intelligent Sensing & Syst Lab, Hobart, Tas, Australia
[2] Cent Queensland Univ, Ctr Intelligent & Networked Syst, Rockhampton, Qld 4702, Australia
关键词
Ensemble classifier; Genetic Algorithm; Multiple Classifier Systems; Cluster Based Ensemble Classifiers; Diversity in Ensemble Classifiers; RANDOM SUBSPACE ENSEMBLES; DIVERSITY; MODEL;
D O I
10.1016/j.knosys.2013.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel cluster oriented ensemble classifier generation method and a Genetic Algorithm based approach to optimize the parameters. In the proposed method the data set is partitioned into a variable number of clusters at different layers. Base classifiers are trained on the clusters at different layers. Due to the variability of the number of clusters at different layers, the cluster compositions in one layer are different from that in another layer. Due to this difference in cluster contents, the base classifiers trained at different layers are diverse among each other. A test pattern is classified by the base classifier of the nearest cluster at each layer and the decisions from different layers are fused using majority voting. The accuracy of the proposed method depends on the number of layers and the number of clusters at the corresponding layer. A Genetic Algorithm based search is incorporated to obtain the optimal number of layers and clusters. The Genetic Algorithm is evaluated under three different objective functions: optimizing (i) accuracy, (ii) diversity, and (iii) accuracy x diversity. We have conducted a number of experiments to evaluate the effectiveness of the different objective functions. (C) 2013 Elsevier By. All rights reserved.
引用
收藏
页码:30 / 42
页数:13
相关论文
共 39 条
[1]  
[Anonymous], 2009, HIERARCHICAL CLUSTER
[2]  
[Anonymous], 2000, HDB PARAMETRIC NONPA
[3]   Bio-molecular cancer prediction with random subspace ensembles of support vector machines [J].
Bertoni, A ;
Folgieri, R ;
Valentini, G .
NEUROCOMPUTING, 2005, 63 :535-539
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Pasting small votes for classification in large databases and on-line [J].
Breiman, L .
MACHINE LEARNING, 1999, 36 (1-2) :85-103
[7]  
Brown G., 2005, Information Fusion, V6, P5, DOI 10.1016/j.inffus.2004.04.004
[8]   Multiobjective Neural Network Ensembles Based on Regularized Negative Correlation Learning [J].
Chen, Huanhuan ;
Yao, Xin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (12) :1738-1751
[9]   Regularized Negative Correlation Learning for Neural Network Ensembles [J].
Chen, Huanhuan ;
Yao, Xin .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (12) :1962-1979
[10]  
Demsar J, 2006, J MACH LEARN RES, V7, P1