Transfer learning in constructive induction with Genetic Programming

被引:24
作者
Munoz, Luis [1 ]
Trujillo, Leonardo [1 ]
Silva, Sara [2 ]
机构
[1] Tecol Nacl Mexico IT Tijuana, Dept Ingn Elect & Elect, Ciencias Ingn, Blvd Ind & Ave ITR Tijuana S-N, Tijuana 22500, BC, Mexico
[2] Univ Lisbon, Fac Ciencias, LASIGE, Lisbon, Portugal
关键词
Transfer learning; Constructive induction of features; Genetic Programming; SYMBOLIC REGRESSION; CLASSIFICATION; PERFORMANCE; ALGORITHMS; TOOL;
D O I
10.1007/s10710-019-09368-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transfer learning (TL) is the process by which some aspects of a machine learning model generated on a source task is transferred to a target task, to simplify the learning required to solve the target. TL in Genetic Programming (GP) has not received much attention, since it is normally assumed that an evolved symbolic expression is specifically tailored to a problem's data and thus cannot be used in other problems. The goal of this work is to present a broad and diverse study of TL in GP, considering a varied set of source and target tasks, and dealing with questions that have received little, or no attention, in previous GP literature. In particular, this work studies the performance of transferred solutions when the source and target tasks are from different domains, and when they do not share a similar input feature space. Additionally, the relationship between the success and failure of transferred solutions is studied, considering different source and target tasks. Finally, the predictability of TL performance is analyzed for the first time in GP literature. GP-based constructive induction of features is used to carry out the study, a wrapper-based approach where GP is used to construct feature transformations and an additional learning algorithm is used to fit the final model. The experimental work presents several notable results and contributions. First, TL is capable of generating solutions that outperform, in many cases, baseline methods in classification and regression tasks. Second, it is shown that some problems are good source problems while others are good targets in a TL system. Third, the transferability of solutions is not necessarily symmetric between two problems. Finally, results show that it is possible to predict the success of TL in some cases, particularly in classification tasks.
引用
收藏
页码:529 / 569
页数:41
相关论文
共 85 条
[1]  
Ahmad H., 2018, P GEN EV COMP C COMP, P1878
[2]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[3]  
ALTEWBERG L., 1994, Proceedings of the First IEEE Conference on Evolutionary Computation, IEEE World Congress on Computational Intelligence, V1, DOI DOI 10.1109/ICEC.1994.350019
[4]  
[Anonymous], 2009, P 11 ANN C GEN EV CO
[5]   Multiple Regression Genetic Programming [J].
Arnaldo, Ignacio ;
Krawiec, Krzysztof ;
O'Reilly, Una-May .
GECCO'14: PROCEEDINGS OF THE 2014 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2014, :879-886
[6]   Building Predictive Models via Feature Synthesis [J].
Arnaldo, Ignacio ;
O'Reilly, Una-May ;
Veeramachaneni, Kalyan .
GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, :983-990
[7]  
Bensusan H., 1996, EV COMP MACH LEARN W
[8]  
Bleuler S, 2001, IEEE C EVOL COMPUTAT, P536, DOI 10.1109/CEC.2001.934438
[9]   Kaizen Programming [J].
de Melo, Vinicius Veloso .
GECCO'14: PROCEEDINGS OF THE 2014 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2014, :895-902
[10]  
de Melo VV, 2016, KAIZEN PROGRAMMING F, P39