Error-correcting output codes based ensemble feature extraction

被引:27
作者
Zhong, Guoqiang [1 ]
Liu, Cheng-Lin [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit NLPR, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Ensemble learning; Error-correcting output codes (ECOC); Meta learner; NONLINEAR DIMENSIONALITY REDUCTION; CLASSIFIERS; COMBINATION; MANIFOLDS; DESIGN; ECOC;
D O I
10.1016/j.patcog.2012.10.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel feature extraction method based on ensemble learning. Using the error-correcting output codes (ECOC) to design binary classifiers (dichotomizers) for separating subsets of classes, the outputs of the dichotomizers are linear or nonlinear features that provide powerful separability in a new space. In this space, the vector quantization based meta classifier can be viewed as an ECOC decoder, where each learned prototype of a class can be seen as a codeword of the class in the new representation space. We conducted extensive experiments on 16 multi-class data sets from the UCI machine learning repository. The results demonstrate the superiority of the proposed method over both existing ECOC approaches and classic feature extraction approaches. In particular, the decoding strategy using a meta classifier is shown to be more computationally efficient than the linear loss-weighted decoding in state-of-the-art ECOC methods. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1091 / 1100
页数:10
相关论文
共 53 条
[1]   Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[2]   An information theoretic framework for weight estimation in the combination of probabilistic classifiers for speaker identification [J].
Altinçay, H ;
Demirekler, M .
SPEECH COMMUNICATION, 2000, 30 (04) :255-272
[3]   Generalized discriminant analysis using a kernel approach [J].
Baudat, G ;
Anouar, FE .
NEURAL COMPUTATION, 2000, 12 (10) :2385-2404
[4]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[5]  
Bengio Y., 2003, NIPS, P177, DOI DOI 10.5555/2981345.2981368
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   Multiple classifiers applied to multisource remote sensing data [J].
Briem, GJ ;
Benediktsson, JA ;
Sveinsson, JR .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2002, 40 (10) :2291-2299
[8]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[9]   On the learnability and design of output codes for multiclass problems [J].
Crammer, K ;
Singer, Y .
MACHINE LEARNING, 2002, 47 (2-3) :201-233
[10]  
Dekel O., 2002, ADV NEURAL INFORM PR, V15, P945