An Optimization-Based Method for Feature Ranking in Nonlinear Regression Problems

被引:9
作者
Bravi, Luca [1 ]
Piccialli, Veronica [2 ]
Sciandrone, Marco [1 ]
机构
[1] Univ Florence, Dipartimento Ingn Informaz, I-50139 Florence, Italy
[2] Univ Roma Tor Vergata, Dipartimento Ingn Civile & Ingn Informat, I-00173 Rome, Italy
关键词
Concave approximation of the zero-norm function; feature ranking; global optimization; inversion of a neural network; FEEDFORWARD NEURAL-NETWORKS; FEATURE-SELECTION; CLASSIFICATION;
D O I
10.1109/TNNLS.2015.2504957
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this brief, we consider the feature ranking problem, where, given a set of training instances, the task is to associate a score with the features in order to assess their relevance. Feature ranking is a very important tool for decision support systems, and may be used as an auxiliary step of feature selection to reduce the high dimensionality of real-world data. We focus on regression problems by assuming that the process underlying the generated data can be approximated by a continuous function (for instance, a feedforward neural network). We formally state the notion of relevance of a feature by introducing a minimum zero-norm inversion problem of a neural network, which is a nonsmooth, constrained optimization problem. We employ a concave approximation of the zero-norm function, and we define a smooth, global optimization problem to be solved in order to assess the relevance of the features. We present the new feature ranking method based on the solution of instances of the global optimization problem depending on the available training data. Computational experiments on both artificial and real data sets are performed, and point out that the proposed feature ranking method is a valid alternative to existing methods in terms of effectiveness. The obtained results also show that the method is costly in terms of CPU time, and this may be a limitation in the solution of large-dimensional problems.
引用
收藏
页码:1005 / 1010
页数:6
相关论文
共 32 条
  • [1] GMDH-based feature ranking and selection for improved classification of medical data
    Abdel-Aal, RE
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2005, 38 (06) : 456 - 468
  • [2] Toward a gold standard for promoter prediction evaluation
    Abeel, Thomas
    Van de Peer, Yves
    Saeys, Yvan
    [J]. BIOINFORMATICS, 2009, 25 (12) : I313 - I320
  • [3] On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems
    Amaldi, E
    Kann, V
    [J]. THEORETICAL COMPUTER SCIENCE, 1998, 209 (1-2) : 237 - 260
  • [4] [Anonymous], 1997, P 14 INT C MACH LEAR
  • [5] Feature selection for nonlinear models with extreme learning machines
    Benoit, Frenay
    van Heeswijk, Mark
    Miche, Yoan
    Verleysen, Michel
    Lendasse, Amaury
    [J]. NEUROCOMPUTING, 2013, 102 : 111 - 124
  • [6] Bertsekas, 1982, COMPUTER SCI APPL MA
  • [7] Bi J., 2003, Journal of Machine Learning Research, V3, P1229, DOI 10.1162/153244303322753643
  • [8] Bishop CM, 1995, Neural Networks for Pattern Recognition
  • [9] Selection of relevant features and examples in machine learning
    Blum, AL
    Langley, P
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 245 - 271
  • [10] A review of feature selection methods on synthetic data
    Bolon-Canedo, Veronica
    Sanchez-Marono, Noelia
    Alonso-Betanzos, Amparo
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 34 (03) : 483 - 519