A comprehensive survey on feature selection in the various fields of machine learning

被引:225
作者
Dhal, Pradip [1 ]
Azad, Chandrashekhar [1 ]
机构
[1] Natl Inst Technol, Dept Compter Applicat, Jamshedpur, Bihar, India
基金
英国科研创新办公室;
关键词
Feature selection; Classification; Machine learning; PARTICLE SWARM OPTIMIZATION; 2-STAGE FEATURE-SELECTION; GENE-EXPRESSION DATA; FEATURE-EXTRACTION; COLONY OPTIMIZATION; INFORMATION GAIN; BIG DATA; CLASSIFICATION; RECOGNITION; SPEECH;
D O I
10.1007/s10489-021-02550-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Machine Learning (ML), Feature Selection (FS) plays a crucial part in reducing data's dimensionality and enhancing any proposed framework's performance. However, in real-world applications, FS work suffers from high dimensionality, computational and storage complexity, noisy or ambiguous nature, high performance, etc. The area of FS is very vast and challenging in its nature. There are lots of work that have been reported on FS over the various area of applications. This paper has discussed FS's framework and the multiple models of FS with detailed descriptions. We have also classified the various FS algorithms with respect to the data, i.e., structured or labeled data and unstructured data for the different applications of ML. We have also discussed what essential features are, the commonly used FS methods, the widely used datasets, and the widely used work done in the various ML fields for the FS task. Here we try to view the multiple comparison experimental results of FS work in different result discussions. This paper draws a descriptive survey on FS with the associated area of real-world problem domains. This paper's main objective is to understand the main idea of FS work and identify the core idea of how FS will be applicable in various problem domains.
引用
收藏
页码:4543 / 4581
页数:39
相关论文
共 148 条
[1]   Newton's second law based PSO for feature selection: Newtonian PSO [J].
Agarwal, Shikha ;
Dhyani, Akshay ;
Ranjan, Prabhat .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) :4923-4935
[2]  
Aich S, 2019, INT CONF ADV COMMUN, P1122, DOI [10.23919/icact.2019.8702017, 10.23919/ICACT.2019.8702017]
[3]   Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers [J].
Akcay, Mehmet Berkehan ;
Oguz, Kaya .
SPEECH COMMUNICATION, 2020, 116 :56-76
[4]   Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis [J].
Akhtar, Md Shad ;
Gupta, Deepak ;
Ekbal, Asif ;
Bhattacharyya, Pushpak .
KNOWLEDGE-BASED SYSTEMS, 2017, 125 :116-135
[5]   Feature ranking for enhancing boosting-based multi-label text categorization [J].
Al-Salemi, Bassam ;
Ayob, Masri ;
Noah, Shahrul Azman Mohd .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 113 :531-543
[6]   Binary Multi-Objective Grey Wolf Optimizer for Feature Selection in Classification [J].
Al-Tashi, Qasem ;
Abdulkadir, Said Jadid ;
Rais, Helmi Md ;
Mirjalili, Seyedali ;
Alhussian, Hitham ;
Ragab, Mohammed G. ;
Alqushaibi, Alawi .
IEEE ACCESS, 2020, 8 :106247-106263
[7]   Optimal Feature Selection based on Image Pre-processing using Accelerated Binary Particle Swarm Optimization for Enhanced Face Recognition [J].
Aneesh, M. U. ;
Masand, Abhishek A. K. ;
Manikantan, K. .
INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 :750-758
[8]  
[Anonymous], 2012, 4 INT C INTELL HUM C, DOI [DOI 10.1109/IHCI.2012.6481826, 10.1109/IHCI.2012.6481826]
[9]  
[Anonymous], 2006, 2006 IEEE ODYSSEY TH, DOI DOI 10.1109/ODYSSEY.2006.248084
[10]  
Arivazhagen S, 2005, ICCIMA 2005: SIXTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, P315