A novel multi-objective genetic algorithm based error correcting output codes

被引:16
作者
Zhang, Yu-Ping [1 ]
Ye, Xiao-Na [1 ]
Liu, Kun-Hong [1 ]
Yao, Jun-Feng [1 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen, Fujian, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Multi-objective; Genetic algorithm (GA); Error correcting output codes (ECOC); Pairwise diversity; Multiclass classification; Heterogeneous ensemble; CANCER-DIAGNOSIS; DEPENDENT DESIGN; MULTICLASS; CLASSIFICATION; ENSEMBLE; PREDICTION; CLASSIFIERS; SELECTION; ECOC; OPTIMIZATION;
D O I
10.1016/j.swevo.2020.100709
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Up to now, different genetic algorithm (GA) based error correcting output codes (ECOC) algorithms have been proposed by setting accuracy as the optimization objective. However, it was demonstrated that diversity among learners is of great significance to a robust ensemble. In this paper, we propose a multi-objective GA with setting accuracy and diversity as two objectives. To further promote diversity in an ensemble, a new individual structure is designed to accommodate heterogeneous dichotomizers. Three multi-objective ranking strategies are deployed to balance two objectives respectively. A novel genetic operator is designed to produce ECOC-compatible offspring in the evolutionary process, and a local improvement algorithm is designed to promote individuals' fitness values. To verify the performance of our GA, a single objective ranking strategy and the design of homogeneous learner based GA are also adopted. Ten widely used ECOC algorithms and three famous ensemble algorithms are deployed for performance comparisons based on a set of the UCI data and microarray data sets. Results show that compared with other algorithms, our GA obtains higher performance in most cases due to the trade-off between performance and diversity. Besides, the accommodation of heterogeneous dichotomizers in an ensemble provides higher generalization ability compared with homogeneous ensembles.
引用
收藏
页数:19
相关论文
共 64 条
[1]  
Adair Pearl, 1940, AM J OBSTET GYNECOL, V39, P725
[2]   Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[3]   Minimal design of error-correcting output codes [J].
Angel Bautista, Miguel ;
Escalera, Sergio ;
Baro, Xavier ;
Radeva, Petia ;
Vitria, Jordi ;
Pujol, Oriol .
PATTERN RECOGNITION LETTERS, 2012, 33 (06) :693-702
[4]  
[Anonymous], ECOC J HARBIN ENG U
[5]  
[Anonymous], 2004, FORMAL PATTERN ANAL
[6]   MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia [J].
Armstrong, SA ;
Staunton, JE ;
Silverman, LB ;
Pieters, R ;
de Boer, ML ;
Minden, MD ;
Sallan, SE ;
Lander, ES ;
Golub, TR ;
Korsmeyer, SJ .
NATURE GENETICS, 2002, 30 (01) :41-47
[7]   A genetic-based subspace analysis method for improving Error-Correcting Output Coding [J].
Bagheri, Mohammad Ali ;
Gao, Qigang ;
Escalera, Sergio .
PATTERN RECOGNITION, 2013, 46 (10) :2830-2839
[8]   Gene-expression profiles predict survival of patients with lung adenocarcinoma [J].
Beer, DG ;
Kardia, SLR ;
Huang, CC ;
Giordano, TJ ;
Levin, AM ;
Misek, DE ;
Lin, L ;
Chen, GA ;
Gharib, TG ;
Thomas, DG ;
Lizyness, ML ;
Kuick, R ;
Hayasaka, S ;
Taylor, JMG ;
Iannettoni, MD ;
Orringer, MB ;
Hanash, S .
NATURE MEDICINE, 2002, 8 (08) :816-824
[9]   Multi-objective evolution of oblique decision trees for imbalanced data binary classification [J].
Chabbouh, Marwa ;
Bechikh, Slim ;
Hung, Chih-Cheng ;
Ben Said, Lamjed .
SWARM AND EVOLUTIONARY COMPUTATION, 2019, 49 :1-22
[10]   On the learnability and design of output codes for multiclass problems [J].
Crammer, K ;
Singer, Y .
MACHINE LEARNING, 2002, 47 (2-3) :201-233