EVOLVING COEVOLUTIONARY CLASSIFIERS UNDER LARGE ATTRIBUTE SPACES

被引:1
作者
Doucette, John [1 ]
Lichodzijewski, Peter [1 ]
Heywood, Malcolm [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS B3H 1W5, Canada
来源
GENETIC PROGRAMMING THEORY AND PRACTICE VII | 2010年
关键词
Problem Decomposition; Bid-based Cooperative Behaviors; Symbiotic Coevolution; Subspace Classifier; Large Attribute Spaces; PROBLEM DECOMPOSITION; CLASSIFICATION; MODELS;
D O I
10.1007/978-1-4419-1626-6_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model-building under the supervised learning domain potentially face a dual learning problem of identifying both the parameters of the model and the subset of (domain) attributes necessary to support the model, thus using an embedded as opposed to wrapper or filter based design. Genetic Programming (GP) has always addressed this dual problem, however, further implicit assumptions are made which potentially increase the complexity of the resulting solutions. In this work we are specifically interested in the case of classification under very large attribute spaces. As such it might be expected that multiple independent/overlapping attribute subspaces support the mapping to class labels; whereas GP approaches to classification generally assume a single binary classifier per class, forcing the model to provide a solution in terms of a single attribute subspace and single mapping to class labels. Supporting the more general goal is considered as a requirement for identifying a 'team' of classifiers with non-overlapping classifier behaviors, in which each classifier responds to different subsets of exemplars. Moreover, the subsets of attributes associated with each team member might utilize a unique 'subspace' of attributes. This work investigates the utility of coevolutionary model building for the case of classification problems with attribute vectors consisting of 650 to 100,000 dimensions. The resulting team based coevolutionary evolutionary method-Symbiotic Bid-based (SBB) GP-is compared to alternative embedded classifier approaches of C4.5 and Maximum Entropy Classification (MaxEnt). SSB solutions demonstrate up to an order of magnitude lower attribute count relative to C4.5 and up to two orders of magnitude lower attribute count than MaxEnt while retaining comparable or better classification performance. Moreover, relative to the attribute count of individual models participating within a team, no more than six attributes are ever utilized; adding a further level of simplicity to the resulting solutions.
引用
收藏
页码:37 / 54
页数:18
相关论文
共 25 条
[1]  
[Anonymous], P 10 ANN C GEN EV CO
[2]  
[Anonymous], UCI REPOSITORY MACHI
[3]   Accuracy-based Learning Classifier Systems:: Models, analysis and applications to classification tasks [J].
Bernadó-Mansilla, E ;
Garrell-Guiu, JM .
EVOLUTIONARY COMPUTATION, 2003, 11 (03) :209-238
[4]   Evolving Teams of Predictors with Linear Genetic Programming [J].
Markus Brameier ;
Wolfgang Banzhaf .
Genetic Programming and Evolvable Machines, 2001, 2 (4) :381-407
[5]  
Daume Hal., 2004, NOTES CG LM BFGS OPT
[6]   A monotonic archive for pareto-coevolution [J].
de Jong, Edwin D. .
EVOLUTIONARY COMPUTATION, 2007, 15 (01) :61-93
[7]  
DOUCETTE J, 2009, PROBLEM DECOMP UNPUB
[8]  
Doucette J, 2008, LECT NOTES COMPUT SC, V4971, P266, DOI 10.1007/978-3-540-78671-9_23
[9]   GP ensembles for large-scale data classification [J].
Folino, Gianluigi ;
Pizzuti, Clara ;
Spezzano, Giandomenico .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2006, 10 (05) :604-616
[10]   Scaling large margin classifiers for spoken language understanding [J].
Haffner, P .
SPEECH COMMUNICATION, 2006, 48 (3-4) :239-261