Feature Selection in High Dimensional Data: A Review

被引:1
作者
Silaich, Sarita [1 ]
Gupta, Suneet [2 ]
机构
[1] Govt Polytech Coll Jhunjhunu, Dept Comp Sci & Engn, Jhunjhunu, India
[2] Mody Univ Laxmangarh, CSE Dept, Sikar, India
来源
THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1 | 2023年 / 608卷
关键词
Feature selection; Filter; Wrapper; Embedded; High dimensional data; Machine learning;
D O I
10.1007/978-981-19-9225-4_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
By choosing an ideal subset of the total features, feature selection in machine learning is essential to reducing the quantity of the data and increasing classifier performance. Nowadays, the size of data is increasing exponentially in fields like text classification, microarray data, bioinformatics, gene expression, information retrieval, etc. In high dimensional or big data, the learning model's predictions are not accurate because of noisy or irrelevant features, so there is a challenge to reduce the data dimensionality. This paper introduces the concepts of feature relevance, relevant feature selection, and evaluation criteria. An overview and comparison of existing feature selection methods for various application domains are also done.
引用
收藏
页码:703 / 717
页数:15
相关论文
共 24 条
[1]   Metaheuristic Algorithms on Feature Selection: A Survey of One Decade of Research (2009-2019) [J].
Agrawal, Prachi ;
Abutarboush, Hattan F. ;
Ganesh, Talari ;
Mohamed, Ali Wagdy .
IEEE ACCESS, 2021, 9 :26766-26791
[2]   Approaches to Multi-Objective Feature Selection: A Systematic Literature Review [J].
Al-Tashi, Qasem ;
Abdulkadir, Said Jadid ;
Rais, Helmi Md ;
Mirjalili, Seyedali ;
Alhussian, Hitham .
IEEE ACCESS, 2020, 8 :125076-125096
[3]  
Aman G, 2020, FEATURE SELECTION TE
[4]   Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection [J].
Ang, Jun Chin ;
Mirzal, Andri ;
Haron, Habibollah ;
Hamed, Haza Nuzly Abdull .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (05) :971-989
[5]   Benchmark for filter methods for feature selection in high-dimensional classification data [J].
Bommert, Andrea ;
Sun, Xudong ;
Bischl, Bernd ;
Rahnenfuehrer, Joerg ;
Lang, Michel .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 143
[6]   Selecting critical features for data classification based on machine learning methods [J].
Chen, Rung-Ching ;
Dewi, Christine ;
Huang, Su-Wen ;
Caraka, Rezzy Eko .
JOURNAL OF BIG DATA, 2020, 7 (01)
[7]   Gene selection for tumor classification using a novel bio-inspired multi-objective approach [J].
Dashtban, M. ;
Balafar, Mohammadali ;
Suravajhala, Prashanth .
GENOMICS, 2018, 110 (01) :10-17
[8]   Feature Selection Based on Structured Sparsity: A Comprehensive Study [J].
Gui, Jie ;
Sun, Zhenan ;
Ji, Shuiwang ;
Tao, Dacheng ;
Tan, Tieniu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (07) :1490-1507
[9]   Pareto front feature selection based on artificial bee colony optimization [J].
Hancer, Emrah ;
Xue, Bing ;
Zhang, Mengjie ;
Karaboga, Dervis ;
Akay, Bahriye .
INFORMATION SCIENCES, 2018, 422 :462-479
[10]   MIFS-ND: A mutual information-based feature selection method [J].
Hoque, N. ;
Bhattacharyya, D. K. ;
Kalita, J. K. .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (14) :6371-6385