Fast Bi-Objective Feature Selection Using Entropy Measures and Bayesian Inference

被引:3
|
作者
Mei, Yi [1 ]
Xue, Bing [1 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & CS, Wellington, New Zealand
来源
GECCO'16: PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE | 2016年
关键词
Feature Selection; Multi-Objective Computation; Generalization; MUTUAL INFORMATION; OPTIMIZATION; ALGORITHM; CLASSIFICATION; SIMILARITY;
D O I
10.1145/2908812.2908823
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The entropy measures have been used in feature selection for decades, and showed competitive performance. In general, the problem aims at minimizing the conditional entropy of the class label on the selected features. However, the generalization of the entropy measures has been neglected in literature. Specifically, the use of conditional entropy has two critical issues. First, the empirical conditional distribution of the class label may have a low confidence and thus is unreliable. Second, there may not be enough training instances for the selected features, and it is highly likely to encounter new examples in the test set. To address these issues, a bi-objective optimization model with a modified entropy measure called the Bayesian entropy is proposed. This model considers the confidence of the optimized conditional entropy value as well as the conditional entropy value itself. As a result, it produces multiple feature subsets with different trade-offs between the entropy value and its confidence. The experimental results demonstrate that by solving the proposed optimization model with the new entropy measure, the number of features can be dramatically reduced within a much shorter time than the existing algorithms. Furthermore, similar or even better classification accuracy was achieved for most test problems.
引用
收藏
页码:469 / 476
页数:8
相关论文
共 50 条
  • [21] A binary individual search strategy-based bi-objective evolutionary algorithm for high-dimensional feature selection
    Li, Tao
    Zhan, Zhi-Hui
    Xu, Jiu-Cheng
    Yang, Qiang
    Ma, Yuan-Yuan
    INFORMATION SCIENCES, 2022, 610 : 651 - 673
  • [22] Bayesian inference for infinite asymmetric Gaussian mixture with feature selection
    Song, Ziyang
    Ali, Samr
    Bouguila, Nizar
    SOFT COMPUTING, 2021, 25 (08) : 6043 - 6053
  • [23] Analysis of the GRNs Inference by Using Tsallis Entropy and a Feature Selection Approach
    Lopes, Fabricio M.
    de Oliveira, Evaldo A.
    Cesar-, Roberto M., Jr.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 473 - 480
  • [24] A Population Initialization Method Based on Similarity and Mutual Information in Evolutionary Algorithm for Bi-Objective Feature Selection
    Cai, Xu
    Xue, Yu
    ACM Transactions on Evolutionary Learning and Optimization, 2024, 4 (03):
  • [25] A nonparametric Bayesian learning model using accelerated variational inference and feature selection
    Fan, Wentao
    Bouguila, Nizar
    Liu, Xin
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (01) : 63 - 74
  • [26] A nonparametric Bayesian learning model using accelerated variational inference and feature selection
    Wentao Fan
    Nizar Bouguila
    Xin Liu
    Pattern Analysis and Applications, 2019, 22 : 63 - 74
  • [27] Feature selection using rough entropy-based uncertainty measures in incomplete decision systems
    Sun, Lin
    Xu, Jiucheng
    Tian, Yun
    KNOWLEDGE-BASED SYSTEMS, 2012, 36 : 206 - 216
  • [28] Multi-objective feature selection using a Bayesian artificial immune system
    Castro, Pablo A. D.
    Von Zuben, Fernando J.
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2010, 3 (02) : 235 - 256
  • [29] Feature Selection Using Fuzzy Objective Functions
    Vieira, Susana M.
    Sousa, Joao M. C.
    Kaymak, Uzay
    PROCEEDINGS OF THE JOINT 2009 INTERNATIONAL FUZZY SYSTEMS ASSOCIATION WORLD CONGRESS AND 2009 EUROPEAN SOCIETY OF FUZZY LOGIC AND TECHNOLOGY CONFERENCE, 2009, : 1673 - 1678
  • [30] Distributed multi-label feature selection using individual mutual information measures
    Gonzalez-Lopez, Jorge
    Ventura, Sebastian
    Cano, Alberto
    KNOWLEDGE-BASED SYSTEMS, 2020, 188 (188)