Identifying (Quasi) Equally Informative Subsets in Feature Selection Problems for Classification: A Max-Relevance Min-Redundancy Approach

被引:46
|
作者
Karakaya, Gulsah [1 ]
Galelli, Stefano [1 ]
Ahipasaoglu, Selin Damla [1 ]
Taormina, Riccardo [1 ]
机构
[1] Singapore Univ Technol & Design, Pillar Engn Syst & Design, Singapore 487372, Singapore
关键词
Classification algorithms; extreme learning machine; feature selection; multiobjective optimization; neural networks; redundancy; relevance; EXTREME LEARNING-MACHINE; VARIABLE SELECTION; MUTUAL INFORMATION; ALGORITHM; OPTIMIZATION; DEPENDENCY;
D O I
10.1109/TCYB.2015.2444435
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An emerging trend in feature selection is the development of two-objective algorithms that analyze the tradeoff between the number of features and the classification performance of the model built with these features. Since these two objectives are conflicting, a typical result stands in a set of Pareto-efficient subsets, each having a different cardinality and a corresponding discriminating power. However, this approach overlooks the fact that, for a given cardinality, there can be several subsets with similar information content. The study reported here addresses this problem, and introduces a novel multiobjective feature selection approach conceived to identify: 1) a subset that maximizes the performance of a given classifier and 2) a set of subsets that are quasi equally informative, i.e., have almost same classification performance, to the performance maximizing subset. The approach consists of a wrapper [Wrapper for Quasi Equally Informative Subset Selection (W-QEISS)] built on the formulation of a four-objective optimization problem, which is aimed at maximizing the accuracy of a classifier, minimizing the number of features, and optimizing two entropy-based measures of relevance and redundancy. This allows conducting the search in a larger space, thus enabling the wrapper to generate a large number of Pareto-efficient solutions. The algorithm is compared against the mRMR algorithm, a two-objective wrapper and a computationally efficient filter [Filter for Quasi Equally Informative Subset Selection (F-QEISS)] on 24 University of California, Irvine, (UCI) datasets including both binary and multiclass classification. Experimental results show that W-QEISS has the capability of evolving a rich and diverse set of Pareto-efficient solutions, and that their availability helps in: 1) studying the tradeoff between multiple measures of classification performance and 2) understanding the relative importance of each feature. The quasi equally informative subsets are identified at the cost of a marginal increase in the computational time thanks to the adoption of Borg Multiobjective Evolutionary Algorithm and Extreme Learning Machine as global optimization and learning algorithms, respectively.
引用
收藏
页码:1424 / 1437
页数:14
相关论文
共 20 条
  • [1] Fuzzy Mutual Information Based min-Redundancy and Max-Relevance Heterogeneous Feature Selection
    Yu D.
    An S.
    Hu Q.
    International Journal of Computational Intelligence Systems, 2011, 4 (4) : 619 - 633
  • [2] Fuzzy Mutual Information Based min-Redundancy and Max-Relevance Heterogeneous Feature Selection
    Yu, Daren
    An, Shuang
    Hu, Qinghua
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2011, 4 (04) : 619 - 633
  • [3] Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
    Peng, HC
    Long, FH
    Ding, C
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) : 1226 - 1238
  • [4] A Novel Gene Selection Method Based on Sparse Representation and Max-Relevance and Min-Redundancy
    Chen, Min
    He, Xiaoming
    Duan, ShaoBin
    Deng, YingWei
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (02) : 158 - 163
  • [5] The max-relevance and min-redundancy greedy Bayesian network learning algorithm
    Liu, Feng
    Zhu, QiLiang
    BIO-INSPIRED MODELING OF COGNITIVE TASKS, PT 1, PROCEEDINGS, 2007, 4527 : 346 - +
  • [6] Selective classifier chains based on max-relevance and min-redundancy for multi-label classification
    Huang, Ge
    Yang, Youlong
    Bai, Jing
    IAENG International Journal of Computer Science, 2017, 44 (03) : 327 - 336
  • [7] Wavelength selection method for near-infrared spectroscopy based on Max-Relevance Min-Redundancy
    Ma, Xiao-Hui
    Chen, Zheng-Guang
    Liu, Jin-Ming
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2024, 310
  • [8] Research on PSO-SVM base wine grade recognition based on Max-Relevance and Min-Redundancy feature selection
    Yang, Jiaping
    Ren, Dongping
    Liu, Yong
    Zhou, Hailong
    Sun, Yunquan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (17):
  • [9] On fuzzy-rough attribute selection: Criteria of Max-Dependency, Max-Relevance, Min-Redundancy, and Max-Significance
    Maji, Pradipta
    Garai, Partha
    APPLIED SOFT COMPUTING, 2013, 13 (09) : 3968 - 3980
  • [10] A novel sub-models selection algorithm based on max-relevance and min-redundancy neighborhood mutual information
    Xiao, Ling
    Wang, Chen
    Dong, Yunxuan
    Wang, Jianzhou
    INFORMATION SCIENCES, 2019, 486 : 310 - 339