Adaptive Weighted Splines - A New Representation to Genetic Programming for Symbolic Regression

被引:5
作者
Raymond, Christian [1 ]
Chen, Qi [1 ]
Xue, Bing [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Wellington, New Zealand
来源
GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE | 2020年
关键词
Genetic Programming; Symbolic Regression; Generalization; Representation; Spline;
D O I
10.1145/3377930.3390244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Genetic Programming for Symbolic Regression is often prone to overfit the training data, resulting in poor generalization on unseen data. To address this issue, many pieces of research have been devoted to regularization via controlling the model complexity. However, due to the unstructured tree based representation of individuals the model complexity cannot be directly computed, rather approximation of the complexity must be taken. This paper proposes a new novel representation called Adaptive Weighted Splines which enables explicit control over the complexity of individuals using splines. The experimental results confirm that this new representation is significantly better than the tree-based representation at avoiding overfitting and generalizing on unseen data, demonstrating notably better and far more consistent generalization performances on all the benchmark problems. Further analysis also shows that in most cases, the new Genetic Programming method outperforms classical regression techniques such as Linear Regression, Support Vector Regression, K-Nearest Neighbour and Decision Tree Regression and performs competitively with state-of-the-art ensemble regression methods Random Forests and Gradient Boosting.
引用
收藏
页码:1003 / 1011
页数:9
相关论文
共 29 条
  • [1] Aha DW, 1996, LEARNING FROM DATA, P199, DOI [DOI 10.1007/978-1-4612-2404-4_19, 10.1007/978-1-4612-2404-4_19]
  • [2] A survey on evolutionary machine learning
    Al-Sahaf, Harith
    Bi, Ying
    Chen, Qi
    Lensen, Andrew
    Mei, Yi
    Sun, Yanan
    Tran, Binh
    Xue, Bing
    Zhang, Mengjie
    [J]. JOURNAL OF THE ROYAL SOCIETY OF NEW ZEALAND, 2019, 49 (02) : 205 - 228
  • [3] Improving support vector machine classifiers by modifying kernel functions
    Amari, S
    Wu, S
    [J]. NEURAL NETWORKS, 1999, 12 (06) : 783 - 789
  • [4] [Anonymous], 2015, P COMP PUBL 2015 ANN
  • [5] Genetic programming for computational pharmacokinetics in drug discovery and development
    Archetti, Francesco
    Lanzeni, Stefano
    Messina, Enza
    Vanneschi, Leonardo
    [J]. GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2007, 8 (04) : 413 - 432
  • [6] OCCAM RAZOR
    BLUMER, A
    EHRENFEUCHT, A
    HAUSSLER, D
    WARMUTH, MK
    [J]. INFORMATION PROCESSING LETTERS, 1987, 24 (06) : 377 - 380
  • [7] Chen Q., 2018, IEEE T EVOLUTIONARY
  • [8] Improving Generalisation of Genetic Programming for Symbolic Regression with Structural Risk Minimisation
    Chen, Qi
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    [J]. GECCO'16: PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2016, : 709 - 716
  • [9] COHN D, 1994, MACH LEARN, V15, P201, DOI 10.1007/BF00993277
  • [10] Dierckx, 1981, TW REPORTS