Genetic Programming-based Construction of Features for Machine Learning and Knowledge Discovery Tasks

被引:118
作者
Krzysztof Krawiec
机构
[1] Poznań University of Technology,Institute of Computing Science
关键词
genetic programming; machine learning; change of representation; feature construction; feature selection;
D O I
10.1023/A:1020984725014
中图分类号
学科分类号
摘要
In this paper we use genetic programming for changing the representation of the input data for machine learners. In particular, the topic of interest here is feature construction in the learning-from-examples paradigm, where new features are built based on the original set of attributes. The paper first introduces the general framework for GP-based feature construction. Then, an extended approach is proposed where the useful components of representation (features) are preserved during an evolutionary run, as opposed to the standard approach where valuable features are often lost during search. Finally, we present and discuss the results of an extensive computational experiment carried out on several reference data sets. The outcomes show that classifiers induced using the representation enriched by the GP-constructed features provide better accuracy of classification on the test set. In particular, the extended approach proposed in the paper proved to be able to outperform the standard approach on some benchmark problems on a statistically significant level.
引用
收藏
页码:329 / 343
页数:14
相关论文
共 26 条
[1]  
Brameier M.(2001)Evolving teams of predictors with linear genetic programming Genetic Programming and Evolvable Machines 2 381-407
[2]  
Banzhaf W.(1997)Feature selection for classification Intelligent Data Analysis 1 131-156
[3]  
Dash M.(1993)Using genetic algorithms for concept learning Machine Learning 13 161-188
[4]  
Liu H.(2000)Application of genetic programming for multicategory pattern classification IEEE Trans. Evolutionary Comp. 4 242-258
[5]  
De Jong K. A.(2000)Evolutionary weighting of image features for diagnosing of CNS tumors Artif. Intell. in Medicine 19 25-38
[6]  
Spears W. M.(1997)Wrappers for feature subset selection Artif. Intell. Journal 1 273-324
[7]  
Gordon D. F.(2000)Acomparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms Machine Learning 40 203-228
[8]  
Kishore J. K.(1983)Atheory and methodology of inductive learning Artificial Intelligence 20 111-161
[9]  
Patnaik L. M.(2000)Dimensionality reduction using genetic algorithm IEEE Trans. on Evolutionary Computation 4 164-171
[10]  
Mani V.(1998)Feature subset selection using a genetic algorithm IEEE Intelligent Systems (Special Issue on Feature Transformation and Subset Selection) 13 44-49