GPPE:: a method to generate ad-hoc feature extractors for prediction in financial domains

被引:8
作者
Estebanez, Cesar [1 ]
Valls, Jose M. [1 ]
Aler, Ricardo [1 ]
机构
[1] Univ Carlos III Madrid, Madrid 28911, Spain
关键词
genetic programming; projections; attribute construction; dimensionality reduction;
D O I
10.1007/s10489-007-0048-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When dealing with classification and regression problems, there is a strong need for high-quality attributes. This is a capital issue not only in financial problems, but in many Data Mining domains. Constructive Induction methods help to overcome this problem by mapping the original representation into a new one, where prediction becomes easier. In this work we present GPPE: a GP-based method that projects data from an original data space into another one where data approaches linear behavior (linear separability or linear regression). Also, GPPE is able to reduce the dimensionality of the problem by recombining related attributes and discarding irrelevant ones. We have applied GPPE to two financial domains: Bankruptcy prediction and IPO Underpricing prediction. In both cases GPPE automatically generated a new data representation that obtained competitive prediction rates and drastically reduced the dimensionality of the problem.
引用
收藏
页码:174 / 185
页数:12
相关论文
共 24 条
[1]   THE SUCCESS OF BUSINESS FAILURE PREDICTION MODELS - AN INTERNATIONAL SURVEY [J].
ALTMAN, EI .
JOURNAL OF BANKING & FINANCE, 1984, 8 (02) :171-198
[2]   Genetic algorithms and support vector machines for time series classification [J].
Eads, D ;
Hill, D ;
Davis, S ;
Perkins, S ;
Ma, JS ;
Porter, R ;
Theiler, J .
APPLICATIONS AND SCIENCE OF NEURAL NETWORKS, FUZZY SYSTEMS, AND EVOLUTIONARY COMPUTATION V, 2002, 4787 :74-85
[3]   FORECASTING WITH NEURAL NETWORKS - AN APPLICATION USING BANKRUPTCY DATA [J].
FLETCHER, D ;
GOSS, E .
INFORMATION & MANAGEMENT, 1993, 24 (03) :159-167
[4]  
Han I., 1997, J KOREAN OPERATIONS, V22, P163
[5]   Comparison of GENIE and conventional supervised classifiers for multispectral image feature extraction [J].
Harvey, NR ;
Theiler, J ;
Brumby, SP ;
Perkins, S ;
Szymanski, JJ ;
Bloch, JJ ;
Porter, RB ;
Galassi, M ;
Young, AC .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2002, 40 (02) :393-404
[6]   The genetic kernel support vector machine: Description and evaluation [J].
Howley, T ;
Madden, MG .
ARTIFICIAL INTELLIGENCE REVIEW, 2005, 24 (3-4) :379-395
[7]  
HU YJ, 1998, GENETIC PROGRAMMING, P146
[8]   Artificial neural network models for pricing initial public offerings [J].
Jain, BA ;
Nag, BN .
DECISION SCIENCES, 1995, 26 (03) :283-302
[9]  
Jolliffe I. T., 1986, PRINCIPAL COMPONENT, DOI DOI 10.1016/0169-7439(87)80084-9
[10]  
KOZA JR, 1994, STAT COMPUT, V4, P87, DOI 10.1007/BF00175355