Cost-sensitive selection of variables by ensemble of model sequences

被引:0
|
作者
Yan, Donghui [1 ,2 ]
Qin, Zhiwei [3 ]
Gu, Songxiang [4 ]
Xu, Haiping [5 ]
Shao, Ming [5 ]
机构
[1] Univ Massachusetts, Dept Math, Dartmouth, MA 02747 USA
[2] Univ Massachusetts, Program Data Sci, Dartmouth, MA 02747 USA
[3] DiDi Res Amer, Mountain View, CA USA
[4] JD Digital, Mountain View, CA USA
[5] Univ Massachusetts, Dept Comp & Informat Sci, Dartmouth, MA USA
关键词
Metrics selection; Cost-sensitive; Budget; Ensemble; Model schedule; Classification; REGULARIZATION; REGRESSION;
D O I
10.1007/s10115-021-01551-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications require the collection of data on different variables or measurements over many system performance metrics. We term those broadly as measures or variables. Often data collection along each measure incurs a cost, thus it is desirable to consider the cost of measures in modeling. This is a fairly new class of problems in the area of cost-sensitive learning. A few attempts have been made to incorporate costs in combining and selecting measures. However, existing studies either do not strictly enforce a budget constraint, or are not the 'most' cost effective. With a focus on classification problems, we propose a computationally efficient approach that could find a near optimal model under a given budget by exploring the most 'promising' part of the solution space. Instead of outputting a single model, we produce a model schedule-a list of models, sorted by model costs and expected predictive accuracy. This could be used to choose the model with the best predictive accuracy under a given budget, or to trade off between the budget and the predictive accuracy. Experiments on some benchmark datasets show that our approach compares favorably to competing methods.
引用
收藏
页码:1069 / 1092
页数:24
相关论文
共 50 条
  • [1] Cost-sensitive selection of variables by ensemble of model sequences
    Donghui Yan
    Zhiwei Qin
    Songxiang Gu
    Haiping Xu
    Ming Shao
    Knowledge and Information Systems, 2021, 63 : 1069 - 1092
  • [2] Evolutionary Cost-Sensitive Ensemble for Malware Detection
    Krawczyk, Bartosz
    Wozniak, Michal
    INTERNATIONAL JOINT CONFERENCE SOCO'14-CISIS'14-ICEUTE'14, 2014, 299 : 433 - 442
  • [3] Cost-sensitive and ensemble-based prediction model for outsourced software project risk prediction
    Hu, Yong
    Feng, Bin
    Mo, Xizhu
    Zhang, Xiangzhou
    Ngai, E. W. T.
    Fan, Ming
    Liu, Mei
    DECISION SUPPORT SYSTEMS, 2015, 72 : 11 - 23
  • [4] A hybrid cost-sensitive ensemble for heart disease prediction
    Qi Zhenya
    Zhang, Zuoru
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [5] A hybrid cost-sensitive ensemble for heart disease prediction
    Qi Zhenya
    Zuoru Zhang
    BMC Medical Informatics and Decision Making, 21
  • [6] Cost-Sensitive Sequences of Bregman Divergences
    Santos-Rodriguez, Raul
    Cid-Sueiro, Jesus
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (12) : 1896 - 1904
  • [7] Cost-sensitive ensemble learning: a unifying framework
    Petrides, George
    Verbeke, Wouter
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (01) : 1 - 28
  • [8] Cost-sensitive ensemble learning: a unifying framework
    George Petrides
    Wouter Verbeke
    Data Mining and Knowledge Discovery, 2022, 36 : 1 - 28
  • [9] Cost-Sensitive Active Learning for Incomplete Data
    Wang, Min
    Yang, Chunyu
    Zhao, Fei
    Min, Fan
    Wang, Xizhao
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (01): : 405 - 416
  • [10] A cost-sensitive constrained Lasso
    Blanquero, Rafael
    Carrizosa, Emilio
    Ramirez-Cobo, Pepa
    Remedios Sillero-Denamiel, M.
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (01) : 121 - 158