Cost-sensitive selection of variables by ensemble of model sequences

被引:0
|
作者
Yan, Donghui [1 ,2 ]
Qin, Zhiwei [3 ]
Gu, Songxiang [4 ]
Xu, Haiping [5 ]
Shao, Ming [5 ]
机构
[1] Univ Massachusetts, Dept Math, Dartmouth, MA 02747 USA
[2] Univ Massachusetts, Program Data Sci, Dartmouth, MA 02747 USA
[3] DiDi Res Amer, Mountain View, CA USA
[4] JD Digital, Mountain View, CA USA
[5] Univ Massachusetts, Dept Comp & Informat Sci, Dartmouth, MA USA
关键词
Metrics selection; Cost-sensitive; Budget; Ensemble; Model schedule; Classification; REGULARIZATION; REGRESSION;
D O I
10.1007/s10115-021-01551-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many applications require the collection of data on different variables or measurements over many system performance metrics. We term those broadly as measures or variables. Often data collection along each measure incurs a cost, thus it is desirable to consider the cost of measures in modeling. This is a fairly new class of problems in the area of cost-sensitive learning. A few attempts have been made to incorporate costs in combining and selecting measures. However, existing studies either do not strictly enforce a budget constraint, or are not the 'most' cost effective. With a focus on classification problems, we propose a computationally efficient approach that could find a near optimal model under a given budget by exploring the most 'promising' part of the solution space. Instead of outputting a single model, we produce a model schedule-a list of models, sorted by model costs and expected predictive accuracy. This could be used to choose the model with the best predictive accuracy under a given budget, or to trade off between the budget and the predictive accuracy. Experiments on some benchmark datasets show that our approach compares favorably to competing methods.
引用
收藏
页码:1069 / 1092
页数:24
相关论文
共 50 条
  • [41] A Cost-Sensitive Feature Selection Method for High-Dimensional Data
    An, Chaojie
    Zhou, Qifeng
    14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 1089 - 1094
  • [42] Cost-Sensitive Face Recognition
    Zhang, Yin
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (10) : 1758 - 1769
  • [43] An Adaptive Cost-sensitive Classifier
    Chen, Xiaolin
    Song, Enming
    Ma, Guangzhi
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 699 - 701
  • [44] Cost-sensitive meta-learning framework
    Shilbayeh, Samar Ali
    Vadera, Sunil
    JOURNAL OF MODELLING IN MANAGEMENT, 2021, : 987 - 1007
  • [45] Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks
    Aliaksandr Barushka
    Petr Hajek
    Neural Computing and Applications, 2020, 32 : 4239 - 4257
  • [46] Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks
    Barushka, Aliaksandr
    Hajek, Petr
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09): : 4239 - 4257
  • [47] A cost-sensitive rotation forest algorithm for gene expression data classification
    Lu, Huijuan
    Yang, Lei
    Yan, Ke
    Xue, Yu
    Gao, Zhigang
    NEUROCOMPUTING, 2017, 228 : 270 - 276
  • [48] Cost-Sensitive Online Classification
    Wang, Jialei
    Zhao, Peilin
    Hoi, Steven C. H.
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 1140 - 1145
  • [49] An Efficient Predictive Model for Myocardial Infarction Using Cost-sensitive J48 Model
    Daraei, Atefeh
    Hamidi, Hodjat
    IRANIAN JOURNAL OF PUBLIC HEALTH, 2017, 46 (05) : 682 - 692
  • [50] Cost-sensitive continuous ensemble kernel learning for imbalanced data streams with concept drift
    Chen, Yingying
    Yang, Xiaowei
    Dai, Hong-Liang
    KNOWLEDGE-BASED SYSTEMS, 2024, 284