Identifying (Quasi) Equally Informative Subsets in Feature Selection Problems for Classification: A Max-Relevance Min-Redundancy Approach

被引：46

作者：

Karakaya, Gulsah ^{[1
]}

Galelli, Stefano ^{[1
]}

Ahipasaoglu, Selin Damla ^{[1
]}

Taormina, Riccardo ^{[1
]}

机构：

[1] Singapore Univ Technol & Design, Pillar Engn Syst & Design, Singapore 487372, Singapore

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2016年 / 46卷 / 06期

关键词：

Classification algorithms; extreme learning machine; feature selection; multiobjective optimization; neural networks; redundancy; relevance; EXTREME LEARNING-MACHINE; VARIABLE SELECTION; MUTUAL INFORMATION; ALGORITHM; OPTIMIZATION; DEPENDENCY;

D O I：

10.1109/TCYB.2015.2444435

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

An emerging trend in feature selection is the development of two-objective algorithms that analyze the tradeoff between the number of features and the classification performance of the model built with these features. Since these two objectives are conflicting, a typical result stands in a set of Pareto-efficient subsets, each having a different cardinality and a corresponding discriminating power. However, this approach overlooks the fact that, for a given cardinality, there can be several subsets with similar information content. The study reported here addresses this problem, and introduces a novel multiobjective feature selection approach conceived to identify: 1) a subset that maximizes the performance of a given classifier and 2) a set of subsets that are quasi equally informative, i.e., have almost same classification performance, to the performance maximizing subset. The approach consists of a wrapper [Wrapper for Quasi Equally Informative Subset Selection (W-QEISS)] built on the formulation of a four-objective optimization problem, which is aimed at maximizing the accuracy of a classifier, minimizing the number of features, and optimizing two entropy-based measures of relevance and redundancy. This allows conducting the search in a larger space, thus enabling the wrapper to generate a large number of Pareto-efficient solutions. The algorithm is compared against the mRMR algorithm, a two-objective wrapper and a computationally efficient filter [Filter for Quasi Equally Informative Subset Selection (F-QEISS)] on 24 University of California, Irvine, (UCI) datasets including both binary and multiclass classification. Experimental results show that W-QEISS has the capability of evolving a rich and diverse set of Pareto-efficient solutions, and that their availability helps in: 1) studying the tradeoff between multiple measures of classification performance and 2) understanding the relative importance of each feature. The quasi equally informative subsets are identified at the cost of a marginal increase in the computational time thanks to the adoption of Borg Multiobjective Evolutionary Algorithm and Extreme Learning Machine as global optimization and learning algorithms, respectively.

引用

页码：1424 / 1437

页数：14

共 20 条

[1] Fuzzy Mutual Information Based min-Redundancy and Max-Relevance Heterogeneous Feature Selection
Yu D.
An S.
Hu Q.
International Journal of Computational Intelligence Systems, 2011, 4 (4) : 619 - 633
[2] Fuzzy Mutual Information Based min-Redundancy and Max-Relevance Heterogeneous Feature Selection
Yu, Daren
An, Shuang
Hu, Qinghua
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2011, 4 (04) : 619 - 633
[3] Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
Peng, HC
Long, FH
Ding, C
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (08) : 1226 - 1238
[4] A Novel Gene Selection Method Based on Sparse Representation and Max-Relevance and Min-Redundancy
Chen, Min
He, Xiaoming
Duan, ShaoBin
Deng, YingWei
COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (02) : 158 - 163
[5] The max-relevance and min-redundancy greedy Bayesian network learning algorithm
Liu, Feng
Zhu, QiLiang
BIO-INSPIRED MODELING OF COGNITIVE TASKS, PT 1, PROCEEDINGS, 2007, 4527 : 346 - +
[6] Selective classifier chains based on max-relevance and min-redundancy for multi-label classification
Huang, Ge
Yang, Youlong
Bai, Jing
IAENG International Journal of Computer Science, 2017, 44 (03) : 327 - 336
[7] Wavelength selection method for near-infrared spectroscopy based on Max-Relevance Min-Redundancy
Ma, Xiao-Hui
Chen, Zheng-Guang
Liu, Jin-Ming
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2024, 310
[8] Research on PSO-SVM base wine grade recognition based on Max-Relevance and Min-Redundancy feature selection
Yang, Jiaping
Ren, Dongping
Liu, Yong
Zhou, Hailong
Sun, Yunquan
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (17):
[9] On fuzzy-rough attribute selection: Criteria of Max-Dependency, Max-Relevance, Min-Redundancy, and Max-Significance
Maji, Pradipta
Garai, Partha
APPLIED SOFT COMPUTING, 2013, 13 (09) : 3968 - 3980
[10] A novel sub-models selection algorithm based on max-relevance and min-redundancy neighborhood mutual information
Xiao, Ling
Wang, Chen
Dong, Yunxuan
Wang, Jianzhou
INFORMATION SCIENCES, 2019, 486 : 310 - 339

← 1 2 →