Convex ensemble learning with sparsity and diversity

被引:40
作者
Yin, Xu-Cheng [1 ,2 ]
Huang, Kaizhu [3 ]
Yang, Chun [1 ]
Hao, Hong-Wei [4 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Comp Sci & Technol, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing Key Lab Mat Sci Knowledge Engn, Beijing 100083, Peoples R China
[3] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215123, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Classifier ensemble; Sparsity; Diversity; Convex quadratic programming; MULTIPLE CLASSIFIERS; COMBINATION; REGRESSION; SELECTION;
D O I
10.1016/j.inffus.2013.11.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classifier ensemble has been broadly studied in two prevalent directions, i.e., to diversely generate classifier components, and to sparsely combine multiple classifiers. While most current approaches are emphasized on either sparsity or diversity only, we investigate classifier ensemble focused on both in this paper. We formulate the classifier ensemble problem with the sparsity and diversity learning in a general mathematical framework, which proves beneficial for grouping classifiers. In particular, derived from the error-ambiguity decomposition, we design a convex ensemble diversity measure. Consequently, accuracy loss, sparseness regularization, and diversity measure can be balanced and combined in a convex quadratic programming problem. We prove that the final convex optimization leads to a closed-form solution, making it very appealing for real ensemble learning problems. We compare our proposed novel method with other conventional ensemble methods such as Bagging, least squares combination, sparsity learning, and AdaBoost, extensively on a variety of UCI benchmark data sets and the Pascal Large Scale Learning Challenge 2008 webspam data. Experimental results confirm that our approach has very promising performance. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:49 / 59
页数:11
相关论文
共 63 条
[1]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[2]  
Banfield R. E., 2005, Information Fusion, V6, P49, DOI 10.1016/j.inffus.2004.04.005
[3]   The combination of multiple classifiers using an evidential reasoning approach [J].
Bi, Yaxin ;
Guan, Jiwen ;
Bell, David .
ARTIFICIAL INTELLIGENCE, 2008, 172 (15) :1731-1751
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Brown G., 2005, Information Fusion, V6, P5, DOI 10.1016/j.inffus.2004.04.004
[6]   Local Classifier Weighting by Quadratic Programming [J].
Cevikalp, Hakan ;
Polikar, Robi .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (10) :1832-+
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]  
Chen H., 2009, IEEE T KNOWL DATA EN, V22, P1751
[9]   Regularized Negative Correlation Learning for Neural Network Ensembles [J].
Chen, Huanhuan ;
Yao, Xin .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (12) :1962-1979
[10]   Predictive Ensemble Pruning by Expectation Propagation [J].
Chen, Huanhuan ;
Tino, Peter ;
Yao, Xin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (07) :999-1013