Recent advances in feature selection and its applications

被引:269
作者
Li, Yun [1 ,2 ]
Li, Tao [1 ,2 ,3 ]
Liu, Huan [4 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing, Jiangsu, Peoples R China
[3] Florida Int Univ, Sch Comp Sci, Miami, FL 33199 USA
[4] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ USA
关键词
Feature selection; Survey; Data mining; ONLINE FEATURE-SELECTION; GENE SELECTION; CLASSIFICATION; CANCER; REGRESSION; RELEVANCE; SECURITY;
D O I
10.1007/s10115-017-1059-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the key problems for machine learning and data mining. In this review paper, a brief historical background of the field is given, followed by a selection of challenges which are of particular current interests, such as feature selection for high-dimensional small sample size data, large-scale data, and secure feature selection. Along with these challenges, some hot topics for feature selection have emerged, e.g., stable feature selection, multi-view feature selection, distributed feature selection, multi-label feature selection, online feature selection, and adversarial feature selection. Then, the recent advances of these topics are surveyed in this paper. For each topic, the existing problems are analyzed, and then, current solutions to these problems are presented and discussed. Besides the topics, some representative applications of feature selection are also introduced, such as applications in bioinformatics, social media, and multimedia retrieval.
引用
收藏
页码:551 / 577
页数:27
相关论文
共 104 条
  • [1] Robust biomarker identification for cancer diagnosis with ensemble feature selection methods
    Abeel, Thomas
    Helleputte, Thibault
    Van de Peer, Yves
    Dupont, Pierre
    Saeys, Yvan
    [J]. BIOINFORMATICS, 2010, 26 (03) : 392 - 398
  • [2] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [3] [Anonymous], 2010, P 13 INT C ART INT S
  • [4] [Anonymous], P INT C MACH LEARN
  • [5] [Anonymous], 2014, ADV NEURAL INFORM PR
  • [6] [Anonymous], 2012, P 18 ACM SIGKDD INT
  • [7] [Anonymous], P 32 INT C MACH LEAR
  • [8] [Anonymous], 2014, Advances in Neural Information Processing Systems.
  • [9] [Anonymous], 2006, Journal of the Royal Statistical Society, Series B
  • [10] [Anonymous], P SIAM INT C DAT MIN