A novel Error-Correcting Output Codes algorithm based on genetic programming

被引:15
作者
Li, Ke-Sen [1 ,2 ,3 ]
Wang, Han-Rui [1 ,4 ]
Liu, Kun-Hong [1 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen, Fujian, Peoples R China
[2] Chinese Acad Sci, Inst Software, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Fudan Univ, Sch Software, Shanghai, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Error-Correcting Output Codes (ECOC); Genetic programming (GP); Multiclass classification; Genetic operators; MULTICLASS; ENSEMBLE; CLASSIFICATION; DESIGN; CLASSIFIERS; ECOC; OPTIMIZATION; EQUATIONS; SELECTION; FOREST;
D O I
10.1016/j.swevo.2019.100564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Error-Correcting Output Codes (ECOC) is widely used in the field of multiclass classification. As an optimal codematrix is key to the performance of an ECOC algorithm, this paper proposes a genetic programming (GP) based ECOC algorithm (GP-ECOC). In the design of individual of our GP, each terminal node represents a class, and nonterminal nodes combine the classes in their child nodes. In this way, an individual is a class combination tree, and represents an ECOC codematrix. A legality checking process is embedded in our algorithm to check each codematrix, so as to ensure each codematrix satisfying ECOC constraints. Those violating the constraints will be corrected by a proposed Guided Mutation operator. Before fitness evaluation, a local optimization algorithm is proposed to append new columns for tough classes, so as to improve the generalization ability of each individual and accelerate the evolutionary speed. In this way, our GP can evolve optimal codematrices through the evolutionary process. Experiments show that compared with other ensemble algorithms, our algorithm can achieve stable and high performances with relatively small ensemble scales on various UCI data sets. To the best of our knowledge, it is the first time that GP has been applied to implement the ECOC encoding algorithm. Our Python code is available at https://github.com/samuellees/gpecoc.
引用
收藏
页数:18
相关论文
共 55 条
[1]   Reducing multiclass to binary: A unifying approach for margin classifiers [J].
Allwein, EL ;
Schapire, RE ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :113-141
[2]   Minimal design of error-correcting output codes [J].
Angel Bautista, Miguel ;
Escalera, Sergio ;
Baro, Xavier ;
Radeva, Petia ;
Vitria, Jordi ;
Pujol, Oriol .
PATTERN RECOGNITION LETTERS, 2012, 33 (06) :693-702
[3]  
[Anonymous], 2018, SWARM EVOLUTIONARY C
[4]  
[Anonymous], ECOC J HARBIN ENG U
[5]  
[Anonymous], 24 INT C PATT REC BE
[6]  
[Anonymous], OPTIMIZATION CLASSIF
[7]  
[Anonymous], 14 INT C INT COMP WU
[8]  
[Anonymous], 18 INT C PATT REC P
[9]   A genetic-based subspace analysis method for improving Error-Correcting Output Coding [J].
Bagheri, Mohammad Ali ;
Gao, Qigang ;
Escalera, Sergio .
PATTERN RECOGNITION, 2013, 46 (10) :2830-2839
[10]   Traffic Sign Recognition Using Evolutionary Adaboost Detection and Forest-ECOC Classification [J].
Baro, Xavier ;
Escalera, Sergio ;
Vitria, Jordi ;
Pujol, Oriol ;
Radeva, Petia .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2009, 10 (01) :113-126