Recent advances in feature selection and its applications

被引:1
作者
Yun Li
Tao Li
Huan Liu
机构
[1] Nanjing University of Posts and Telecommunications,School of Computer Science and Technology
[2] Nanjing University of Posts and Telecommunications,Jiangsu Key Laboratory of Big Data Security and Intelligent Processing
[3] Florida International University,School of Computer Science
[4] Arizona State University,School of Computing, Informatics, and Decision Systems Engineering
来源
Knowledge and Information Systems | 2017年 / 53卷
关键词
Feature selection; Survey; Data mining;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection is one of the key problems for machine learning and data mining. In this review paper, a brief historical background of the field is given, followed by a selection of challenges which are of particular current interests, such as feature selection for high-dimensional small sample size data, large-scale data, and secure feature selection. Along with these challenges, some hot topics for feature selection have emerged, e.g., stable feature selection, multi-view feature selection, distributed feature selection, multi-label feature selection, online feature selection, and adversarial feature selection. Then, the recent advances of these topics are surveyed in this paper. For each topic, the existing problems are analyzed, and then, current solutions to these problems are presented and discussed. Besides the topics, some representative applications of feature selection are also introduced, such as applications in bioinformatics, social media, and multimedia retrieval.
引用
收藏
页码:551 / 577
页数:26
相关论文
共 148 条
  • [1] Guyon I(2003)An introduction to variable and feature selection J Mach Learn Res 31 1157-1182
  • [2] Elisseeff A(2005)Toward integrating feature selection algorithms for classification and clustering IEEE Trans Knowl Data Eng 17 494-502
  • [3] Liu H(1968)On the mean accuracy of statistical pattern recognizers IEEE Trans Inf Theory 14 55-63
  • [4] Yu L(1984)Selection of subsets of regression variables J R Stat Soc 147 389-425
  • [5] Hughes GF(1997)Wrappers for feature subset selection Artif Intell 97 273-324
  • [6] Miller AJ(2004)Filter versus wrapper gene selection approaches in DNA microarray domains Artif Intell Med 31 91-103
  • [7] Kohavi R(2003)An extensive empirical study of feature selection metrics for text classification J Mach Learn Res 3 1289-1305
  • [8] John G(1992)Training a 3-node neural networks is NP-complete Neural Netw 5 117-127
  • [9] Inza I(1999)Molecular classification of cancer: class discovery and class prediction by gene expression monitoring Science 286 531-537
  • [10] Larranaga P(2002)Gene expression correlates of clinical prostate cancer behavior Cancer Cell 2 203-209