A comparison of fitness-case sampling methods for symbolic regression with genetic programming

被引:0
|
作者
Martínez, Yuliana [1 ]
Trujillo, Leonardo [1 ]
Naredo, Enrique [1 ]
Legrand, Pierrick [2 ,3 ]
机构
[1] TREE-LAB, Departamento de Ingeniería Eléctrica y Electrónica, Instituto Tecnológico de Tijuana, Blvd. Industrial y Av. ITR Tijuana S/N, Mesa Otay C.P. 22500, Tijuana, B.C
[2] Université Victor Segalen Bordeaux 2 and The Institut de Mathmatiques de Bordeaux
[3] ALEA Team, INRIA Bordeaux Sud-Ouest
来源
Advances in Intelligent Systems and Computing | 2014年 / 288卷
关键词
Fitness-case sampling; Performance evaluation; Symbolic regression;
D O I
10.1007/978-3-319-07494-8_14
中图分类号
学科分类号
摘要
The canonical approach towards fitness evaluation in Genetic Programming (GP) is to use a static training set to determine fitness, based on a cost function averaged over all fitness-cases. However, motivated by different goals, researchers have recently proposed several techniques that focus selective pressure on a subset of fitness-cases at each generation. These approaches can be described as fitness-case sampling techniques, where the training set is sampled, in some way, to determine fitness. This paper shows a comprehensive evaluation of some of the most recent sampling methods, using benchmark and real-world problems for symbolic regression. The algorithms considered here are Interleaved Sampling, Random Interleaved Sampling, Lexicase Selection and a new sampling technique is proposed called Keep-Worst Interleaved Sampling (KW-IS). The algorithms are extensively evaluated based on test performance, overfitting and bloat. Results suggest that sampling techniques can improve performance compared with standard GP. While on synthetic benchmarks the difference is slight or none at all, on real-world problems the differences are substantial. Some of the best results were achieved by Lexicase Selection and KeepWorse-Interleaved Sampling. Results also show that on real-world problems overfitting correlates strongly with bloating. Furthermore, the sampling techniques provide efficiency, since they reduce the number of fitness-case evaluations required over an entire run. © Springer International Publishing Switzerland 2014.
引用
收藏
页码:201 / 212
页数:11
相关论文
共 50 条
  • [11] Genetic programming with separability detection for symbolic regression
    Wei-Li Liu
    Jiaquan Yang
    Jinghui Zhong
    Shibin Wang
    Complex & Intelligent Systems, 2021, 7 : 1185 - 1194
  • [12] Genetic Programming-Based Selection of Imputation Methods in Symbolic Regression with Missing Values
    Al-Helali, Baligh
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    AI 2020: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 12576 : 163 - 175
  • [13] An Analysis of Exchanging Fitness Cases with Population Size in Symbolic Regression Genetic Programming with Respect to the Computational Model
    Applegate, Douglas
    Mayfield, Blayne
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 3111 - 3116
  • [14] Bingo: A Customizable Framework for Symbolic Regression with Genetic Programming
    Randall, David L.
    Townsend, Tyler S.
    Hochhalter, Jacob D.
    Bomarito, Geoffrey F.
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 2282 - 2288
  • [15] An efficient memetic genetic programming framework for symbolic regression
    Cheng, Tiantian
    Zhong, Jinghui
    MEMETIC COMPUTING, 2020, 12 (04) : 299 - 315
  • [16] An efficient memetic genetic programming framework for symbolic regression
    Tiantian Cheng
    Jinghui Zhong
    Memetic Computing, 2020, 12 : 299 - 315
  • [17] Semantic schema based genetic programming for symbolic regression
    Zojaji, Zahra
    Ebadzadeh, Mohammad Mehdi
    Nasiri, Hamid
    APPLIED SOFT COMPUTING, 2022, 122
  • [18] Investigation of Linear Genetic Programming Techniques for Symbolic Regression
    Dal Piccol Sotto, Leo Francoso
    de Melo, Vinicius Veloso
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 146 - 151
  • [19] Population diversity and inheritance in genetic programming for symbolic regression
    Burlacu, Bogdan
    Yang, Kaifeng
    Affenzeller, Michael
    NATURAL COMPUTING, 2024, 23 (03) : 531 - 566
  • [20] Symbolic Regression via Control Variable Genetic Programming
    Jiang, Nan
    Xue, Yexiang
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV, 2023, 14172 : 178 - 195