Semi-greedy heuristics for feature selection with test cost constraints

被引:34
|
作者
Min F. [1 ]
Xu J. [1 ]
机构
[1] School of Computer Science, Southwest Petroleum University, Chengdu
基金
中国国家自然科学基金;
关键词
Feature selection; Granular computing; Semi-greedy; Test cost constraint;
D O I
10.1007/s41066-016-0017-2
中图分类号
学科分类号
摘要
In real-world applications, the test cost of data collection should not exceed a given budget. The problem of selecting an informative feature subset under this budget is referred to as feature selection with test cost constraints. Greedy heuristics are a natural and efficient method for this kind of combinatorial optimization problem. However, the recursive selection of locally optimal choices means that the global optimum is often missed. In this paper, we present a three-step semi-greedy heuristic method that directly forms a population of candidate solutions to obtain better results. In the first step, we design the heuristic function. The second step involves the random selection of a feature from the current best k features at each iteration. This is the major difference from conventional greedy heuristics. In the third step, we obtain p candidate solutions and select the best one. Through a series of experiments on four datasets, we compare our algorithm with a classic greedy heuristic approach and an information gain-based λ-weighted greedy heuristic method. The results show that the new approach is more likely to obtain optimal solutions. © 2016, Springer International Publishing Switzerland.
引用
收藏
页码:199 / 211
页数:12
相关论文
共 50 条
  • [41] VARIABLE COSTS-BASED MULTI-GRANULARITY FEATURE SELECTION WITH TEST COST CONSTRAINT
    Liao, Shujiao
    Lin, Yidong
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2020, 16 (06): : 2047 - 2061
  • [42] An Efficient Improved Greedy Harris Hawks Optimizer and Its Application to Feature Selection
    Zou, Lewang
    Zhou, Shihua
    Li, Xiangjun
    ENTROPY, 2022, 24 (08)
  • [43] Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis
    Alweshah, Mohammed
    Alkhalaileh, Saleh
    Al-Betar, Mohammed Azmi
    Abu Bakar, Azuraliza
    KNOWLEDGE-BASED SYSTEMS, 2022, 235
  • [44] A Greedy Feature Selection Algorithm for Brain-Computer Interface Classification Committees
    Trofimov, Alexander G.
    Shishkin, Sergei L.
    Kozyrskiy, Bogdan L.
    Velichkovsky, Boris M.
    8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 488 - 493
  • [45] Bagging Constraint Score for feature selection with pairwise constraints
    Sun, Dan
    Zhang, Daoqiang
    PATTERN RECOGNITION, 2010, 43 (06) : 2106 - 2118
  • [46] An Empirical Evaluation of Techniques for Feature Selection with Cost
    Adams, Stephen
    Meekins, Ryan
    Beling, Peter A.
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 834 - 841
  • [47] Semi-supervised feature selection based on discernibility matrix and mutual information
    Qian, Wenbin
    Wan, Lijuan
    Shu, Wenhao
    APPLIED INTELLIGENCE, 2024, 54 (13-14) : 7278 - 7295
  • [48] An improved firefly heuristics for efficient feature selection and its application in big data
    Selvi, Senthamil R.
    Valarmathi, M. L.
    BIOMEDICAL RESEARCH-INDIA, 2017, 28 : S236 - S241
  • [49] Simple strategies for semi-supervised feature selection
    Konstantinos Sechidis
    Gavin Brown
    Machine Learning, 2018, 107 : 357 - 395
  • [50] A Survey on semi-supervised feature selection methods
    Sheikhpour, Razieh
    Sarram, Mehdi Agha
    Gharaghani, Sajjad
    Chahooki, Mohammad Ali Zare
    PATTERN RECOGNITION, 2017, 64 : 141 - 158