Semi-greedy heuristics for feature selection with test cost constraints

被引:34
|
作者
Min F. [1 ]
Xu J. [1 ]
机构
[1] School of Computer Science, Southwest Petroleum University, Chengdu
基金
中国国家自然科学基金;
关键词
Feature selection; Granular computing; Semi-greedy; Test cost constraint;
D O I
10.1007/s41066-016-0017-2
中图分类号
学科分类号
摘要
In real-world applications, the test cost of data collection should not exceed a given budget. The problem of selecting an informative feature subset under this budget is referred to as feature selection with test cost constraints. Greedy heuristics are a natural and efficient method for this kind of combinatorial optimization problem. However, the recursive selection of locally optimal choices means that the global optimum is often missed. In this paper, we present a three-step semi-greedy heuristic method that directly forms a population of candidate solutions to obtain better results. In the first step, we design the heuristic function. The second step involves the random selection of a feature from the current best k features at each iteration. This is the major difference from conventional greedy heuristics. In the third step, we obtain p candidate solutions and select the best one. Through a series of experiments on four datasets, we compare our algorithm with a classic greedy heuristic approach and an information gain-based λ-weighted greedy heuristic method. The results show that the new approach is more likely to obtain optimal solutions. © 2016, Springer International Publishing Switzerland.
引用
收藏
页码:199 / 211
页数:12
相关论文
共 50 条
  • [1] Application of Greedy Heuristics for Feature Characterisation and Selection: A Case Study in Stylometric Domain
    Stanczyk, Urszula
    Zielosko, Beata
    Zabinski, Krzysztof
    ROUGH SETS, IJCRS 2018, 2018, 11103 : 350 - 362
  • [2] Feature selection with test cost constraint
    Min, Fan
    Hu, Qinghua
    Zhu, William
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2014, 55 (01) : 167 - 179
  • [3] A semi-parallel framework for greedy information-theoretic feature selection
    Liu, Heng
    Ditzler, Gregory
    INFORMATION SCIENCES, 2019, 492 : 13 - 28
  • [4] Feature Selection with Test Cost Constraint through a Simulated Annealing Algorithm
    Niu, Junxia
    Zhao, Hong
    Zhu, William
    JOURNAL OF INTERNET TECHNOLOGY, 2016, 17 (06): : 1133 - 1140
  • [5] Neural Greedy Pursuit for Feature Selection
    Das, Sandipan
    Javid, Alireza M.
    Gohain, Prakash Borpatra
    Eldar, Yonina C.
    Chatterjee, Saikat
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms
    Rudolf Jagdhuber
    Michel Lang
    Arnulf Stenzl
    Jochen Neuhaus
    Jörg Rahnenführer
    BMC Bioinformatics, 21
  • [7] Cost-Constrained feature selection in binary classification: adaptations for greedy forward selection and genetic algorithms
    Jagdhuber, Rudolf
    Lang, Michel
    Stenzl, Arnulf
    Neuhaus, Jochen
    Rahnenfuehrer, Joerg
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [8] Greedy Binary Search and Feature Subset Selection
    Han, Myung-Mook
    Li, Dong-hui
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2009, 12 (06): : 1379 - 1395
  • [9] Efficient greedy feature selection for unsupervised learning
    Ahmed K. Farahat
    Ali Ghodsi
    Mohamed S. Kamel
    Knowledge and Information Systems, 2013, 35 : 285 - 310
  • [10] Wrapper Framework for Test-Cost-Sensitive Feature Selection
    Jiang, Liangxiao
    Kong, Ganggang
    Li, Chaoqun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (03): : 1747 - 1756