Multi-generation multi-criteria feature construction using Genetic Programming

被引:5
作者
Ma, Jianbin [1 ,2 ]
Gao, Xiaoying [3 ]
Li, Ying [4 ]
机构
[1] Hebei Agr Univ, Coll Informat Sci & Technol, Baoding 071001, Peoples R China
[2] Hebei Key Lab Agr Big Data, Baoding 071001, Peoples R China
[3] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6140, New Zealand
[4] Hebei Agr Univ, Coll Econ & Management, Baoding 071001, Peoples R China
关键词
Feature construction; Genetic programming; Overfitting; Multi-generation; Multi-criteria; MULTIPLE FEATURE CONSTRUCTION; FEATURE-SELECTION; FEATURE-EXTRACTION; NEURAL-NETWORKS; CLASSIFICATION; EVOLUTIONARY; OPTIMIZATION; INFORMATION;
D O I
10.1016/j.swevo.2023.101285
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of feature construction is to create new high level features from the original features. When Genetic Programming (GP) is applied to wrapper-based feature construction, especially when the samples size is small, GP generally overfits the training set and generalizes poorly with the deepening of evolution. Overfitting has attracted wide attention in some classification models, however, it is not commonly studied in the field of feature construction. In this paper, a Multi-Generation feature construction method (MG) is developed to preserve the solutions produced by multiple generations of GP. A Multi-Criteria feature construction method (MC) is introduced to use a multi-criteria evaluation function to evaluate GP individuals. Combining the above two methods, a Multi-Generation Multi-Criteria feature construction method (MGMC) is proposed. Experiments on fourteen datasets show that the proposed MG and MC methods can improve the classification performance and overcome overfitting problems of traditional feature construction methods in most cases. The combined MGMC method further improves the classification performance and achieves the best results.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Towards the Next Generation of Multi-Criteria Recommender Systems
    Li, Zhe
    [J]. 12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, : 553 - 557
  • [42] A Cost-sensitive Multi-criteria Quadratic Programming Model
    Chao, Xiangrui
    Peng, Yi
    [J]. 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT, ITQM 2015, 2015, 55 : 1302 - 1307
  • [43] Feature Construction, Feature Reduction and Search Space Reduction Using Genetic Programming
    Herrera-Sanchez, David
    Mezura-Montes, Efren
    Acosta-Mesa, Hector-Gabriel
    [J]. 2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, : 152 - 156
  • [44] Genetic programming for feature extraction and construction in image classification
    Fan, Qinglan
    Bi, Ying
    Xue, Bing
    Zhang, Mengjie
    [J]. APPLIED SOFT COMPUTING, 2022, 118
  • [45] Multi-stage modeling using fuzzy multi-criteria feature selection to improve survival prediction of ICU septic shock patients
    Cismondi, Federico
    Horn, Abigail L.
    Fialho, Andre S.
    Vieira, Susana M.
    Reti, Shane R.
    Sousa, Joao M. C.
    Finkelstein, Stan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (16) : 12332 - 12339
  • [46] Genetic multi-criteria approach to flexible line scheduling
    Fanti, MP
    Maione, B
    Naso, D
    Turchiano, B
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 1998, 19 (1-2) : 5 - 21
  • [47] Multi-criteria genetic algorithm applied to scheduling in multi-cluster environments
    Gabaldon, E.
    Lerida, J. L.
    Guirado, F.
    Planes, J.
    [J]. JOURNAL OF SIMULATION, 2015, 9 (04) : 287 - 295
  • [48] An automated health indicator construction methodology for prognostics based on multi-criteria optimization
    Nguyen, Khanh T. P.
    Medjaher, Kamal
    [J]. ISA TRANSACTIONS, 2021, 113 : 81 - 96
  • [49] Genetic Programming-Based Feature Construction for System Setting Recognition and Component-Level Prognostics
    Calabrese, Francesca
    Regattieri, Alberto
    Piscitelli, Raffaele
    Bortolini, Marco
    Galizia, Francesco Gabriele
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [50] Multi-criteria warehouse location selection using Choquet integral
    Demirel, Tufan
    Demirel, Nihan Cetin
    Kahraman, Cengiz
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (05) : 3943 - 3952