A comparison of fitness-case sampling methods for symbolic regression with genetic programming

被引:0
|
作者
Martínez, Yuliana [1 ]
Trujillo, Leonardo [1 ]
Naredo, Enrique [1 ]
Legrand, Pierrick [2 ,3 ]
机构
[1] TREE-LAB, Departamento de Ingeniería Eléctrica y Electrónica, Instituto Tecnológico de Tijuana, Blvd. Industrial y Av. ITR Tijuana S/N, Mesa Otay C.P. 22500, Tijuana, B.C
[2] Université Victor Segalen Bordeaux 2 and The Institut de Mathmatiques de Bordeaux
[3] ALEA Team, INRIA Bordeaux Sud-Ouest
来源
Advances in Intelligent Systems and Computing | 2014年 / 288卷
关键词
Fitness-case sampling; Performance evaluation; Symbolic regression;
D O I
10.1007/978-3-319-07494-8_14
中图分类号
学科分类号
摘要
The canonical approach towards fitness evaluation in Genetic Programming (GP) is to use a static training set to determine fitness, based on a cost function averaged over all fitness-cases. However, motivated by different goals, researchers have recently proposed several techniques that focus selective pressure on a subset of fitness-cases at each generation. These approaches can be described as fitness-case sampling techniques, where the training set is sampled, in some way, to determine fitness. This paper shows a comprehensive evaluation of some of the most recent sampling methods, using benchmark and real-world problems for symbolic regression. The algorithms considered here are Interleaved Sampling, Random Interleaved Sampling, Lexicase Selection and a new sampling technique is proposed called Keep-Worst Interleaved Sampling (KW-IS). The algorithms are extensively evaluated based on test performance, overfitting and bloat. Results suggest that sampling techniques can improve performance compared with standard GP. While on synthetic benchmarks the difference is slight or none at all, on real-world problems the differences are substantial. Some of the best results were achieved by Lexicase Selection and KeepWorse-Interleaved Sampling. Results also show that on real-world problems overfitting correlates strongly with bloating. Furthermore, the sampling techniques provide efficiency, since they reduce the number of fitness-case evaluations required over an entire run. © Springer International Publishing Switzerland 2014.
引用
收藏
页码:201 / 212
页数:11
相关论文
共 50 条
  • [41] Adaptive Weighted Splines - A New Representation to Genetic Programming for Symbolic Regression
    Raymond, Christian
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, : 1003 - 1011
  • [42] Decomposition based cross-parallel multiobjective genetic programming for symbolic regression
    Fan, Lei
    Su, Zhaobing
    Liu, Xiyang
    Wang, Yuping
    APPLIED SOFT COMPUTING, 2024, 167
  • [43] Customized prediction of attendance to soccer matches based on symbolic regression and genetic programming
    Yamashita, Gabrielli H.
    Fogliatto, Flavio S.
    Anzanello, Michel J.
    Tortorella, Guilherme L.
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [44] Improving Model-Based Genetic Programming for Symbolic Regression of Small Expressions
    Virgolin, M.
    Alderliesten, T.
    Witteveen, C.
    Bosman, P. A. N.
    EVOLUTIONARY COMPUTATION, 2021, 29 (02) : 211 - 237
  • [45] Preserving Population Diversity Based on Transformed Semantics in Genetic Programming for Symbolic Regression
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2021, 25 (03) : 433 - 447
  • [46] Differential Evolution for Instance based Transfer Learning in Genetic Programming for Symbolic Regression
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 161 - 162
  • [47] Predicting Friction System Performance with Symbolic Regression and Genetic Programming with Factor Variables
    Kronberger, Gabriel
    Kommenda, Michael
    Promberger, Andreas
    Nickel, Falk
    GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 1278 - 1285
  • [48] EFFECTS OF MUTATION BEFORE AND AFTER OFFSPRING SELECTION IN GENETIC PROGRAMMING FOR SYMBOLIC REGRESSION
    Kronberger, Gabriel K.
    Winkler, Stephan M.
    Affenzeller, Michael
    Kommenda, Michael
    Wagner, Stefan
    22ND EUROPEAN MODELING AND SIMULATION SYMPOSIUM (EMSS 2010), 2010, : 37 - 42
  • [49] SYMBOLIC REGRESSION VIA GENETIC PROGRAMMING AS A DISCOVERY ENGINE: INSIGHTS ON OUTLIERS AND PROTOTYPES
    Kotanchek, Mark E.
    Vladislavleva, Ekaterina Y.
    Smits, Guido F.
    GENETIC PROGRAMMING THEORY AND PRACTICE VII, 2010, : 55 - +
  • [50] Improving Genetic Programming Based Symbolic Regression Using Deterministic Machine Learning
    Icke, Ilknur
    Bongard, Joshua C.
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 1763 - 1770