Applying Ant Colony Optimization to configuring stacking ensembles for data mining

被引:56
作者
Chen, Yijun [1 ]
Wong, Man-Leung [1 ]
Li, Haibing [1 ]
机构
[1] Lingnan Univ, Dept Comp & Decis Sci, Tuen Mun, Hong Kong, Peoples R China
关键词
ACO; Ensemble; Stacking; Metaheuristics; Data mining; Direct marketing; CLASSIFICATION; SELECTION;
D O I
10.1016/j.eswa.2013.10.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An ensemble is a collective decision-making system which applies a strategy to combine the predictions of learned classifiers to generate its prediction of new instances. Early research has proved that ensemble classifiers in most cases can be more accurate than any single component classifier both empirically and theoretically. Though many ensemble approaches are proposed, it is still not an easy task to find a suitable ensemble configuration for a specific dataset. In some early works, the ensemble is selected manually according to the experience of the specialists. Metaheuristic methods can be alternative solutions to find configurations. Ant Colony Optimization (ACO) is one popular approach among metaheuristics. In this work, we propose a new ensemble construction method which applies ACO to the stacking ensemble construction process to generate domain-specific configurations. A number of experiments are performed to compare the proposed approach with some well-known ensemble methods on 18 benchmark data mining datasets. The approach is also applied to learning ensembles for a real-world cost-sensitive data mining problem. The experiment results show that the new approach can generate better stacking ensembles. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2688 / 2702
页数:15
相关论文
共 55 条
[11]  
Demiroz G, 1997, LECT NOTES ARTIF INT, V1224, P85
[12]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[13]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15
[14]  
Dorigo M, 2004, ANT COLONY OPTIMIZATION, P1
[15]  
Dzeroski S, 2002, LECT NOTES COMPUT SC, V2364, P201
[16]  
Fan W, 1999, MACHINE LEARNING, PROCEEDINGS, P97
[17]  
Fern X.Z., 2003, ICML, P186
[18]  
Frank A., 2010, UCI machine learning repository, V213
[19]  
Frank E., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P144
[20]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139