Constrained class-wise feature selection (CCFS)

被引:0
|
作者
Syed Fawad Hussain
Fatima Shahzadi
Badre Munir
机构
[1] G.I.K. Institute of Engineering Sciences and Technology,Machine Learning and Data Science Lab (MDS)
[2] G.I.K. Institute,undefined
来源
International Journal of Machine Learning and Cybernetics | 2022年 / 13卷
关键词
Feature selection; Information theory; Classification; Class-wise feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection plays a vital role as a preprocessing step for high dimensional data in machine learning. The basic purpose of feature selection is to avoid “curse of dimensionality” and reduce time and space complexity of training data. Several techniques, including those that use information theory, have been proposed in the literature as a means to measure the information content of a feature. Most of them incrementally select features with max dependency with the category but minimum redundancy with already selected features. A key missing idea in these techniques is the fair representation of features with max dependency among the different categories, i.e., skewed selection of features having high mutual information (MI) with a particular class. This can result in a biased classification in favor of that particular class while other classes have low matching scores during classification. We propose a novel approach based on information theory that selects features in a class-wise fashion rather than based on their global max dependency. In addition, a constrained search is used instead of a global sequential forward search. We prove that our proposed approach enhances Maximum Relevance while keeping Minimum Redundancy under a constrained search. Results on multiple benchmark datasets show that our proposed method improves accuracy as compared to other state-of-the-art feature selection algorithms while having a lower time complexity.
引用
收藏
页码:3211 / 3224
页数:13
相关论文
共 50 条
  • [31] Online Feature Selection of Class Imbalance via PA Algorithm
    Han, Chao
    Tan, Yun-Kun
    Zhu, Jin-Hui
    Guo, Yong
    Chen, Jian
    Wu, Qing-Yao
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2016, 31 (04) : 673 - 682
  • [32] Resolving class imbalance and feature selection in customer churn dataset
    Hanif, Aamer
    Azhar, Noor
    2017 INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT), 2017, : 82 - 86
  • [33] A novel feature selection method to predict protein structural class
    Yuan, Mingshun
    Yang, Zijiang
    Huang, Guangzao
    Ji, Guoli
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2018, 76 : 118 - 129
  • [34] Cost-Sensitive Feature Selection for Class Imbalance Problem
    Bach, Malgorzata
    Werner, Aleksandra
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 182 - 194
  • [35] Multi-Class Feature Selection Using Pairwise-class and All-class Techniques
    Chen, Bo
    Li, Guo-Zheng
    You, Mingyu
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 644 - 647
  • [36] A binary-constrained Geometric Semantic Genetic Programming for feature selection purposes
    Papa, Joao Paulo
    Rosa, Gustavo Henrique
    Papa, Luciene Patrici
    PATTERN RECOGNITION LETTERS, 2017, 100 : 59 - 66
  • [37] Feature selection considering the composition of feature relevancy
    Gao, Wanfu
    Hu, Liang
    Zhang, Ping
    He, Jialong
    PATTERN RECOGNITION LETTERS, 2018, 112 : 70 - 74
  • [38] A New Feature Selection Based on Class Dependency and Feature Dissimilarity
    Claypo, Niphat
    Jaiyen, Saichon
    2015 2ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS ICAICTA, 2015,
  • [39] Budget constrained non-monotonic feature selection
    Yang, Haiqin
    Xu, Zenglin
    Lyu, Michael R.
    King, Irwin
    NEURAL NETWORKS, 2015, 71 : 214 - 224
  • [40] Towards an accurate sleep apnea detection based on ECG signal: The quintessential of a wise feature selection
    Pinho, Andre
    Pombo, Nuno
    Silva, Bruno M. C.
    Bousson, Kouamana
    Garcia, Nuno
    APPLIED SOFT COMPUTING, 2019, 83