Wrapper Framework for Test-Cost-Sensitive Feature Selection

被引:41
|
作者
Jiang, Liangxiao [1 ]
Kong, Ganggang [2 ]
Li, Chaoqun [3 ]
机构
[1] China Univ Geosci, Dept Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Dept Math, Wuhan 430074, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2021年 / 51卷 / 03期
关键词
Feature extraction; Optimization; Support vector machines; Geology; Training; Medical diagnosis; Data mining; Classification accuracy; decision making; feature selection; test cost; test-cost-sensitive learning;
D O I
10.1109/TSMC.2019.2904662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is an optional preprocessing procedure and is frequently used to improve the classification accuracy of a machine learning algorithm by removing irrelevant and/or redundant features. However, in many real-world applications, the test cost is also required for making optimal decisions, in addition to the classification accuracy. To the best of our knowledge, thus far, few studies have been conducted on test-cost-sensitive feature selection (TCSFS). In TCSFS, the objectives are twofold: 1) to improve the classification accuracy and 2) to decrease the test cost. Therefore, in fact, it constitutes a multiobjective optimization problem. In this paper, we transformed this multiobjective optimization problem into a single-objective optimization problem by utilizing a new evaluation function and in this paper, we propose a new general wrapper framework for TCSFS. Specifically, in our proposed framework, we add a new term to the evaluation function of a wrapper feature selection method so that the test cost of measuring features is taken into account. We experimentally tested our proposed framework, using 36 classification problems from the University of California at Irvine (UCI) repository, and compared it to some other state-of-the-art feature selection frameworks. The experimental results showed that our framework allows users to select an optimal feature subset with the minimal test cost, while simultaneously maintaining a high classification accuracy.
引用
收藏
页码:1747 / 1756
页数:10
相关论文
共 50 条
  • [11] Test-cost-sensitive attribute reduction on heterogeneous data for adaptive neighborhood model
    Fan, Anjing
    Zhao, Hong
    Zhu, William
    SOFT COMPUTING, 2016, 20 (12) : 4813 - 4824
  • [12] A novel test-cost-sensitive attribute reduction approach using the binary bat algorithm
    Xie, Xiaojun
    Qin, Xiaolin
    Zhou, Qian
    Zhou, Yanghao
    Zhang, Tong
    Janicki, Ryszard
    Zhao, Wei
    KNOWLEDGE-BASED SYSTEMS, 2019, 186
  • [13] Wrapper-filter feature selection algorithm using a memetic framework
    Zhu, Zexuan
    Ong, Yew-Soon
    Dash, Manoranjan
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01): : 70 - 76
  • [14] Whale optimization approaches for wrapper feature selection
    Mafarja, Majdi
    Mirjalili, Seyedali
    APPLIED SOFT COMPUTING, 2018, 62 : 441 - 453
  • [15] Feature Subset Selection for High-Dimensional, Low Sampling Size Data Classification Using Ensemble Feature Selection With a Wrapper-Based Search
    Mandal, Ashis Kumar
    Nadim, MD.
    Saha, Hasi
    Sultana, Tangina
    Hossain, Md. Delowar
    Huh, Eui-Nam
    IEEE ACCESS, 2024, 12 : 62341 - 62357
  • [16] Evolutionary Feature Selection: A Novel Wrapper Feature Selection Architecture Based on Evolutionary Strategies
    Dubey, Aaryan
    Inoue, Alexandre Hoppe
    Fernandes Birmann, Pedro Terra
    da Silva, Sammuel Ramos
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 359 - 366
  • [17] A set-cover-based approach for the test-cost-sensitive attribute reduction problem
    Anhui Tan
    Weizhi Wu
    Yuzhi Tao
    Soft Computing, 2017, 21 : 6159 - 6173
  • [18] GeFeS: A generalized wrapper feature selection approach for optimizing classification performance
    Sahebi, Golnaz
    Movahedi, Parisa
    Ebrahimi, Masoumeh
    Pahikkala, Tapio
    Plosila, Juha
    Tenhunen, Hannu
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 125
  • [19] Feature selection with test cost constraint
    Min, Fan
    Hu, Qinghua
    Zhu, William
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2014, 55 (01) : 167 - 179
  • [20] Cost-sensitive Feature Selection for Support Vector Machines
    Benitez-Pena, S.
    Blanquero, R.
    Carrizosa, E.
    Ramirez-Cobo, P.
    COMPUTERS & OPERATIONS RESEARCH, 2019, 106 : 169 - 178