Semantic genetic programming for fast and accurate data knowledge discovery

被引:14
作者
Castelli, Mauro [1 ]
Vanneschi, Leonardo [1 ]
Manzoni, Luca [2 ]
Popovic, Ales [1 ,3 ]
机构
[1] Univ Nova Lisboa, NOVA IMS, P-1070312 Lisbon, Portugal
[2] Univ Milano Bicocca, DISCo, I-20126 Milan, Italy
[3] Univ Ljubljana, Fac Econ, Ljubljana 1000, Slovenia
关键词
Genetic programming; Semantics; Knowledge discovery;
D O I
10.1016/j.swevo.2015.07.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data knowledge discovery emerged as an important factor contributing to advancements in society at large. Still, researchers continuously seek to advance existing methods and provide novel ones for analysing vast data sets to make sense of the data, extract useful information, and build knowledge to inform decision making. In the last few years, a very promising variant of genetic programming was proposed: geometric semantic genetic programming. Its difference with the standard version of genetic programming consists in the fact that it uses new genetic operators, called geometric semantic operators, that, acting directly on the semantics of the candidate solutions, induce by definition a unimodal error surface on any supervised learning problem, independently from the complexity and size of the underlying data set. This property should improve the evolvability of genetic programming in presence of big data and thus makes geometric semantic genetic programming an extremely promising method for mining vast amounts of data. Nevertheless, to the best of our knowledge, no contribution has appeared so far to employ this new technology to big data problems. This paper intends to fill this gap. For the first time, in fact, we show the effectiveness of geometric semantic genetic programming on several complex real-life problems, characterized by vast amounts of data, coming from several different application domains. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 27 条
[1]   A modification to classical evolutionary programming by shifting strategy parameters [J].
Alipouri, Yousef ;
Poshtan, Javad ;
Alipouri, Yagub .
APPLIED INTELLIGENCE, 2013, 38 (02) :175-192
[2]  
[Anonymous], 2013, P 7 ACM C RECOMMENDE
[3]  
[Anonymous], 2015, GENET PROGRAM EVOL M, DOI [DOI 10.1007/S10710-014-9218-0, 10.1007/s10710-014-9218-0]
[4]  
Brown Eric, 2015, COMMUNICATION
[5]  
Castelli Mauro, 2015, INT J BIOIN IN PRESS
[6]   Comparing extended classifier system and genetic programming for financial forecasting: an empirical study [J].
Chen, Mu-Yen ;
Chen, Kuang-Ku ;
Chiang, Heien-Kun ;
Huang, Hwa-Shan ;
Huang, Mu-Jung .
SOFT COMPUTING, 2007, 11 (12) :1173-1183
[7]  
Condie T., 2013, SIGMOD '13 Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, P939, DOI DOI 10.1145/2463676.2465338
[8]  
Cozzone AJ., 2010, PROTEINS FUNDAMENTAL
[9]  
Cristianini N., 2000, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods
[10]   A Survey on the Application of Genetic Programming to Classification [J].
Espejo, Pedro G. ;
Ventura, Sebastian ;
Herrera, Francisco .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2010, 40 (02) :121-144