Multi-generation multi-criteria feature construction using Genetic Programming

被引:5
作者
Ma, Jianbin [1 ,2 ]
Gao, Xiaoying [3 ]
Li, Ying [4 ]
机构
[1] Hebei Agr Univ, Coll Informat Sci & Technol, Baoding 071001, Peoples R China
[2] Hebei Key Lab Agr Big Data, Baoding 071001, Peoples R China
[3] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6140, New Zealand
[4] Hebei Agr Univ, Coll Econ & Management, Baoding 071001, Peoples R China
关键词
Feature construction; Genetic programming; Overfitting; Multi-generation; Multi-criteria; MULTIPLE FEATURE CONSTRUCTION; FEATURE-SELECTION; FEATURE-EXTRACTION; NEURAL-NETWORKS; CLASSIFICATION; EVOLUTIONARY; OPTIMIZATION; INFORMATION;
D O I
10.1016/j.swevo.2023.101285
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of feature construction is to create new high level features from the original features. When Genetic Programming (GP) is applied to wrapper-based feature construction, especially when the samples size is small, GP generally overfits the training set and generalizes poorly with the deepening of evolution. Overfitting has attracted wide attention in some classification models, however, it is not commonly studied in the field of feature construction. In this paper, a Multi-Generation feature construction method (MG) is developed to preserve the solutions produced by multiple generations of GP. A Multi-Criteria feature construction method (MC) is introduced to use a multi-criteria evaluation function to evaluate GP individuals. Combining the above two methods, a Multi-Generation Multi-Criteria feature construction method (MGMC) is proposed. Experiments on fourteen datasets show that the proposed MG and MC methods can improve the classification performance and overcome overfitting problems of traditional feature construction methods in most cases. The combined MGMC method further improves the classification performance and achieves the best results.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Defect Identification of Pipeline Ultrasonic Inspection Based on Multi-Feature Fusion and Multi-Criteria Feature Evaluation
    Pan, Feng
    Tang, Donglin
    Guo, Xiansheng
    Pan, Shengwang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (11)
  • [22] A geothermal-biomass powered multi-generation plant with freshwater and hydrogen generation options: Thermo-economic-environmental appraisals and multi-criteria optimization
    Hashemian, Nasim
    Noorpoor, Alireza
    RENEWABLE ENERGY, 2022, 198 : 254 - 266
  • [23] Multi-objective genetic programming for feature extraction and data visualization
    Cano, Alberto
    Ventura, Sebastian
    Cios, Krzysztof J.
    SOFT COMPUTING, 2017, 21 (08) : 2069 - 2089
  • [24] Multi-objective genetic programming for feature extraction and data visualization
    Alberto Cano
    Sebastián Ventura
    Krzysztof J. Cios
    Soft Computing, 2017, 21 : 2069 - 2089
  • [25] Genetic Programming based Feature Construction for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 1033 - 1040
  • [26] A feature selection method with feature ranking using genetic programming
    Liu, Guopeng
    Ma, Jianbin
    Hu, Tongle
    Gao, Xiaoying
    CONNECTION SCIENCE, 2022, 34 (01) : 1146 - 1168
  • [27] Formation of Fuzzy Patterns in Logical Analysis of Data Using a Multi-Criteria Genetic Algorithm
    Masich, Igor S.
    Kulachenko, Margarita A.
    Stanimirovic, Predrag S.
    Popov, Aleksey M.
    Tovbis, Elena M.
    Stupina, Alena A.
    Kazakovtsev, Lev A.
    SYMMETRY-BASEL, 2022, 14 (03):
  • [28] A model-based approach to user preference discovery in multi-criteria recommender system using genetic programming
    Gupta, Shweta
    Kant, Vibhor
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (11)
  • [29] Multi-Criteria Job Scheduling in Grid Using an Accelerated Genetic Algorithm
    Gkoutioudi, Kyriaki Z.
    Karatza, Helen D.
    JOURNAL OF GRID COMPUTING, 2012, 10 (02) : 311 - 323
  • [30] A decision support methodology for stochastic multi-criteria linear programming using spreadsheets
    Novak, DC
    Ragsdale, CT
    DECISION SUPPORT SYSTEMS, 2003, 36 (01) : 99 - 116