Genetic programming for multiple-feature construction on high-dimensional classification

被引:70
作者
Binh Tran [1 ]
Xue, Bing [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
关键词
Feature construction; Genetic programming; Classification; Class dependence; High-dimensional data; SELECTION;
D O I
10.1016/j.patcog.2019.05.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data representation is an important factor in deciding the performance of machine learning algorithms including classification. Feature construction (FC) can combine original features to form high-level ones that can help classification algorithms achieve better performance. Genetic programming (GP) has shown promise in FC due to its flexible representation. Most GP methods construct a single feature, which may not scale well to high-dimensional data. This paper aims at investigating different approaches to constructing multiple features and analysing their effectiveness, efficiency, and underlying behaviours to reveal the insight of multiple-feature construction using GP on high-dimensional data. The results show that multiple-feature construction achieves significantly better performance than single-feature construction. In multiple-feature construction, using multi-tree GP representation is shown to be more effective than using the single-tree GP thanks to the ability to consider the interaction of the newly constructed features during the construction process. Class-dependent constructed features achieve better performance than the class-independent ones. A visualisation of the constructed features also demonstrates the interpretability of the GP-based FC approach, which is important to many real-world applications. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:404 / 417
页数:14
相关论文
共 40 条
[1]   Deep Audio-Visual Speech Recognition [J].
Afouras, Triantafyllos ;
Chung, Joon Son ;
Senior, Andrew ;
Vinyals, Oriol ;
Zisserman, Andrew .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) :8717-8727
[2]  
Ahluwalla M, 1999, GECCO-99: PROCEEDINGS OF THE GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, P947
[3]  
Ahmed S, 2014, 2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), P2756, DOI 10.1109/CEC.2014.6900317
[4]   Automatically Evolving Rotation-Invariant Texture Image Descriptors by Genetic Programming [J].
Al-Sahaf, Harith ;
Al-Sahaf, Ausama ;
Xue, Bing ;
johnston, Mark ;
Zhang, Mengjie .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2017, 21 (01) :83-101
[5]  
[Anonymous], 2002, P GEN EV COMP C
[6]  
[Anonymous], PUBLIC FINANCE RES
[7]   Multi-channel multi-model feature learning for face recognition [J].
Aslan, Melih S. ;
Hailat, Zeyad ;
Alafif, Tarik K. ;
Chen, Xue-Wen .
PATTERN RECOGNITION LETTERS, 2017, 85 :79-83
[8]   Using Feature Clustering for GP-Based Feature Construction on High-Dimensional Data [J].
Binh Tran ;
Xue, Bing ;
Zhang, Mengjie .
GENETIC PROGRAMMING, EUROGP 2017, 2017, 10196 :210-226
[9]   Genetic programming for feature construction and selection in classification on high-dimensional data [J].
Binh Tran ;
Xue, Bing ;
Zhang, Mengjie .
MEMETIC COMPUTING, 2016, 8 (01) :3-15
[10]  
Cha S. H., 2007, International Journal of Mathematical Models and Methods in Applied Sciences, V1, P300