Symbolic Regression via Control Variable Genetic Programming

被引:2
|
作者
Jiang, Nan [1 ]
Xue, Yexiang [1 ]
机构
[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT IV | 2023年 / 14172卷
关键词
Control Variable Experiment; Symbolic Regression; ALGORITHMS; DISCOVERY;
D O I
10.1007/978-3-031-43421-1_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning symbolic expressions directly from experiment data is a vital step in AI-driven scientific discovery. Nevertheless, state-of-the-art approaches are limited to learning simple expressions. Regressing expressions involving many independent variables still remain out of reach. Motivated by the control variable experiments widely utilized in science, we propose Control Variable Genetic Programming (CVGP) for symbolic regression over many independent variables. CVGP expedites symbolic expression discovery via customized experiment design, rather than learning from a fixed dataset collected a priori. CVGP starts by fitting simple expressions involving a small set of independent variables using genetic programming, under controlled experiments where other variables are held as constants. It then extends expressions learned in previous generations by adding new independent variables, using new control variable experiments in which these variables are allowed to vary. Theoretically, we show CVGP as an incremental building approach can yield an exponential reduction in the search space when learning a class of expressions. Experimentally, CVGP outperforms several baselines in learning symbolic expressions involving multiple independent variables.
引用
收藏
页码:178 / 195
页数:18
相关论文
共 50 条
  • [41] Evolvability Degeneration in Multi-Objective Genetic Programming for Symbolic Regression
    Liu, Dazhuang
    Virgolin, Marco
    Alderliesten, Tanja
    Bosman, Peter A. N.
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 973 - 981
  • [42] Denoising Autoencoder Genetic Programming for Real-World Symbolic Regression
    Wittenberg, David
    Rothlauf, Franz
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 612 - 614
  • [43] Improving Generalisation of Genetic Programming for Symbolic Regression with Structural Risk Minimisation
    Chen, Qi
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    GECCO'16: PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2016, : 709 - 716
  • [44] Adaptive Weighted Splines - A New Representation to Genetic Programming for Symbolic Regression
    Raymond, Christian
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, : 1003 - 1011
  • [45] Multivariate linear regression versus symbolic regression from genetic programming. Application to the spectroscopic characterisation of urban wastewater
    Carreres-Prieto, Daniel
    Garcia, Juan T.
    Castillo, Luis G.
    Carrillo, Jose M.
    Vigueras-Rodriguez, Antonio
    INGENIERIA DEL AGUA, 2022, 26 (04): : 261 - 277
  • [46] Decomposition based cross-parallel multiobjective genetic programming for symbolic regression
    Fan, Lei
    Su, Zhaobing
    Liu, Xiyang
    Wang, Yuping
    APPLIED SOFT COMPUTING, 2024, 167
  • [47] Customized prediction of attendance to soccer matches based on symbolic regression and genetic programming
    Yamashita, Gabrielli H.
    Fogliatto, Flavio S.
    Anzanello, Michel J.
    Tortorella, Guilherme L.
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [48] A comparison of fitness-case sampling methods for symbolic regression with genetic programming
    Martínez, Yuliana
    Trujillo, Leonardo
    Naredo, Enrique
    Legrand, Pierrick
    Advances in Intelligent Systems and Computing, 2014, 288 : 201 - 212
  • [49] Improving Model-Based Genetic Programming for Symbolic Regression of Small Expressions
    Virgolin, M.
    Alderliesten, T.
    Witteveen, C.
    Bosman, P. A. N.
    EVOLUTIONARY COMPUTATION, 2021, 29 (02) : 211 - 237
  • [50] Preserving Population Diversity Based on Transformed Semantics in Genetic Programming for Symbolic Regression
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2021, 25 (03) : 433 - 447