Evaluation of a novel GA-based methodology for model structure selection: The GA-PARSIMONY

被引:15
作者
Urraca, R. [1 ]
Sodupe-Ortega, E. [1 ]
Antonanzas, J. [1 ]
Antonanzas-Torres, F. [1 ]
Martinez-de-Pison, F. J. [1 ]
机构
[1] Univ La Rioja, Dept Mech Engn, EDMANS Grp, Logrono 26004, Spain
关键词
Genetic algorithms; Parameter tuning; Feature selection; Parsimony criterion; Model comparative; ANT COLONY OPTIMIZATION; PARAMETER OPTIMIZATION; SOLAR IRRADIATION; FUZZY CONTROL; CLASSIFICATION; ALGORITHMS; MACHINES; SYSTEM; LOGIC; PSO;
D O I
10.1016/j.neucom.2016.08.154
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most proposed metaheuristics for feature selection and model parameter optimization are based on a two-termed Loss + Penalty function. Their main drawback is the need of a manual set of the parameter that balances between the loss and the penalty term. In this paper, a novel methodology referred as the GA-PARSIMONY and specifically designed to overcome this issue is evaluated in detail in thirteen public databases with five regression techniques. It is a GA-based meta-heuristic that splits the classic two-termed minimization functions by making two consecutive ranks of individuals. The first rank is based solely on the generalization error, while the second (named ReRank) is based on the complexity of the models, giving a special weight to the complexity entailed by large number of inputs. For each database, models with lowest testing RMSE and without statistical difference among them were referred as winner models. Within this group, the number of features selected was below 50%, which proves an optimal balance between error minimization and parsimony. Particularly, the most complex algorithms (MLP and SVR) were mostly selected in the group of winner models, while using around 40-45% of the available attributes. The most basic IBk, ridge regression (LIN) and M5P were only classified as winner models in the simpler databases, but using less number of features in those cases (up to a 20-25% of the initial inputs). (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:9 / 17
页数:9
相关论文
共 45 条
  • [1] AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
  • [2] An integrated PSO for parameter determination and feature selection of ELM and its application in classification of power system disturbances
    Ahila, R.
    Sadasivam, V.
    Manimala, K.
    [J]. APPLIED SOFT COMPUTING, 2015, 32 : 23 - 37
  • [3] [Anonymous], 2009, SIGKDD Explorations, DOI DOI 10.1145/1656274.1656278
  • [4] [Anonymous], 2014, INT JOINT C SOCO13 C
  • [5] Solar irradiation mapping with exogenous data from support vector regression machines estimations
    Antonanzas, J.
    Urraca, R.
    Martinez-de-Pison, F. J.
    Antonanzas-Torres, F.
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2015, 100 : 380 - 390
  • [6] Generation of daily global solar irradiation with support vector machines for regression
    Antonanzas-Torres, F.
    Urraca, R.
    Antonanzas, J.
    Fernandez-Ceniceros, J.
    Martinez-de-Pison, F. J.
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2015, 96 : 277 - 286
  • [7] Evolutionary algorithm characterization in real parameter optimization problems
    Caamano, Pilar
    Bellas, Francisco
    Becerra, Jose A.
    Duro, Richard J.
    [J]. APPLIED SOFT COMPUTING, 2013, 13 (04) : 1902 - 1921
  • [8] A new approach for dynamic fuzzy logic parameter tuning in Ant Colony Optimization and its application in fuzzy control of a mobile robot
    Castillo, Oscar
    Neyoy, Hector
    Soria, Jose
    Melin, Patricia
    Valdez, Fevrier
    [J]. APPLIED SOFT COMPUTING, 2015, 28 : 150 - 159
  • [9] New approach using ant colony optimization with ant set partition for fuzzy control design applied to the ball and beam system
    Castillo, Oscar
    Lizarraga, Evelia
    Soria, Jose
    Melin, Patricia
    Valdez, Fevrier
    [J]. INFORMATION SCIENCES, 2015, 294 : 203 - 215
  • [10] Genetic algorithms tuned expert model for detection of epileptic seizures from EEG signatures
    Dhiman, Rohtash
    Saini, J. S.
    Priyanka
    [J]. APPLIED SOFT COMPUTING, 2014, 19 : 8 - 17