Hybridization of feature selection and feature weighting for high dimensional data

被引:0
作者
Dalwinder Singh
Birmohan Singh
机构
[1] Department of Computer Science and Engineering,
[2] Sant Longowal Institute of Engineering and Technology,undefined
[3] Longowal,undefined
来源
Applied Intelligence | 2019年 / 49卷
关键词
Feature selection; Feature weighting; Hybrid method; Optimization algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
The classification of high dimensional data is a challenging problem due to the presence of redundant and irrelevant features in a higher amount. These unwanted features degrade accuracy and increase the computational complexity of machine learning algorithms. In this paper, we propose a hybrid method that integrates the complementary strengths of feature selection and feature weighting approaches for improving the classification of high dimensional data on the Nearest Neighbor classifier. Specifically, we suggest four strategies that combine filter and wrapper methods of feature selection and feature weighting. Experiments are performed on 12 high dimensional datasets and outcomes are supported by Friedman as well as Holm statistical tests for validation. Extended Adjusted Ratio of Ratios is used to recognize the best method considering accuracy, feature selection, and runtime. The results show that two proposed strategies outperform other well-known methods in accuracy and features reduction. The hybrid feature selection-feature weighting wrapper method is the best among all in accuracy while the hybrid feature selection filter-feature weighting wrapper method is the most suitable for reducing features and runtime. Thus, the promising outcomes validate the importance of hybridizing feature selection and feature weighting while dealing with high dimensional data.
引用
收藏
页码:1580 / 1596
页数:16
相关论文
共 170 条
  • [1] Jain AK(2000)Statistical pattern recognition: a review IEEE Trans Pattern Anal Mach Intell 22 4-37
  • [2] Duin RPW(1968)On the mean accuracy of statistical pattern recognizers IEEE Trans Inform Theory 14 55-63
  • [3] Mao J(2004)Efficient feature selection via analysis of relevance and redundancy J Mach Learn Res 5 1205-1224
  • [4] Hughes G(2018)High-dimensional hybrid feature selection using interaction information-guided search Knowl-Based Syst 145 59-66
  • [5] Yu L(2015)Simultaneous instance and feature selection and weighting using evolutionary computation: proposal and study Appl Soft Comput 37 416-443
  • [6] Liu H(2013)Online feature selection with streaming features IEEE Trans Pattern Anal Mach Intell 35 1178-1192
  • [7] Nakariyakul S(2016)Lofs: a library of online streaming feature selection Knowl-Based Syst 113 1-3
  • [8] Pérez-Rodríguez J(2000)Dimensionality reduction using genetic algorithms IEEE Trans Evol Comput 4 164-171
  • [9] Arroyo-Peña AG(1997)A review and empirical evaluation of feature weighting methods for a class of lazy learning algorithms Artif Intell Rev 11 273-314
  • [10] García-Pedrajas N(2007)Iterative RELIEF for feature weighting: algorithms, theories, and applications IEEE Trans Pattern Anal Mach Intell 29 1035-1051