An Optimization-Based Method for Feature Ranking in Nonlinear Regression Problems

被引：9

作者：

Bravi, Luca ^{[1
]}

Piccialli, Veronica ^{[2
]}

Sciandrone, Marco ^{[1
]}

机构：

[1] Univ Florence, Dipartimento Ingn Informaz, I-50139 Florence, Italy

[2] Univ Roma Tor Vergata, Dipartimento Ingn Civile & Ingn Informat, I-00173 Rome, Italy

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2017年 / 28卷 / 04期

关键词：

Concave approximation of the zero-norm function; feature ranking; global optimization; inversion of a neural network; FEEDFORWARD NEURAL-NETWORKS; FEATURE-SELECTION; CLASSIFICATION;

D O I：

10.1109/TNNLS.2015.2504957

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this brief, we consider the feature ranking problem, where, given a set of training instances, the task is to associate a score with the features in order to assess their relevance. Feature ranking is a very important tool for decision support systems, and may be used as an auxiliary step of feature selection to reduce the high dimensionality of real-world data. We focus on regression problems by assuming that the process underlying the generated data can be approximated by a continuous function (for instance, a feedforward neural network). We formally state the notion of relevance of a feature by introducing a minimum zero-norm inversion problem of a neural network, which is a nonsmooth, constrained optimization problem. We employ a concave approximation of the zero-norm function, and we define a smooth, global optimization problem to be solved in order to assess the relevance of the features. We present the new feature ranking method based on the solution of instances of the global optimization problem depending on the available training data. Computational experiments on both artificial and real data sets are performed, and point out that the proposed feature ranking method is a valid alternative to existing methods in terms of effectiveness. The obtained results also show that the method is costly in terms of CPU time, and this may be a limitation in the solution of large-dimensional problems.

引用

页码：1005 / 1010

页数：6

共 32 条

[1] GMDH-based feature ranking and selection for improved classification of medical data
Abdel-Aal, RE
[J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2005, 38 (06) : 456 - 468
[2] Toward a gold standard for promoter prediction evaluation
Abeel, Thomas
Van de Peer, Yves
Saeys, Yvan
[J]. BIOINFORMATICS, 2009, 25 (12) : I313 - I320
[3] On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems
Amaldi, E
Kann, V
[J]. THEORETICAL COMPUTER SCIENCE, 1998, 209 (1-2) : 237 - 260
[4] [Anonymous], 1997, P 14 INT C MACH LEAR
[5] Feature selection for nonlinear models with extreme learning machines
Benoit, Frenay
van Heeswijk, Mark
Miche, Yoan
Verleysen, Michel
Lendasse, Amaury
[J]. NEUROCOMPUTING, 2013, 102 : 111 - 124
[6] Bertsekas, 1982, COMPUTER SCI APPL MA
[7] Bi J., 2003, Journal of Machine Learning Research, V3, P1229, DOI 10.1162/153244303322753643
[8] Bishop CM, 1995, Neural Networks for Pattern Recognition
[9] Selection of relevant features and examples in machine learning
Blum, AL
Langley, P
[J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 245 - 271
[10] A review of feature selection methods on synthetic data
Bolon-Canedo, Veronica
Sanchez-Marono, Noelia
Alonso-Betanzos, Amparo
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 34 (03) : 483 - 519

← 1 2 3 4 →