Class Imbalance and Cost-Sensitive Decision Trees: A Unified Survey Based on a Core Similarity

被引:12
|
作者
Siers, Michael J. [1 ]
Islam, Md Zahidul [1 ]
机构
[1] Charles Sturt Univ, Sch Comp & Math, Panorama Ave, Bathurst, NSW 2795, Australia
关键词
Class imbalance; cost sensitivity; classification; UNDERSAMPLING METHOD; SMOTE; CLASSIFICATION; PREDICTION;
D O I
10.1145/3415156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance treatment methods and cost-sensitive classification algorithms are typically treated as two independent research areas. However, many of these techniques have properties in common. After providing a background to the two fields of research, this article identifies the fundamental mechanism which is common to both. Using this mechanism, a taxonomy is created which encompasses approaches to both class imbalance treatment and cost-sensitive classification. Through this survey, we aim to bridge the gap between the two fields such that lessons from one field may be applied to the other. Many data mining tasks are naturally both class imbalanced and cost-sensitive. This survey is useful for researchers and practitioners approaching these tasks as it provides a detailed overview of approaches in both fields. Many of the surveyed techniques are classifier independent. However, we chose to focus on techniques which were either decision tree-based or compatible with decision trees. This choice was based on the popularity and novelty of their application to both fields.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] A Benefit-Cost Based Method for Cost-Sensitive Decision Trees
    Liu, Xingyi
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III, 2009, : 463 - 467
  • [2] Cost-sensitive decision trees with multiple cost scales
    Qin, ZX
    Zhang, SC
    Zhang, CQ
    AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 380 - 390
  • [3] Evolutionary induction of cost-sensitive decision trees
    Kretowski, Marek
    Grzes, Marek
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 121 - 126
  • [4] Test strategies for cost-sensitive decision trees
    Ling, Charles X.
    Sheng, Victor S.
    Yang, Qiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (08) : 1055 - 1067
  • [5] Cost-Sensitive Pattern-Based classification for Class Imbalance problems
    Loyola-Gonzalez, Octavio
    Fco Martinez-Trinidad, Jose
    Ariel Carrasco-Ochoa, Jesus
    Garcia-Borroto, Milton
    IEEE ACCESS, 2019, 7 : 60411 - 60427
  • [6] Cost-Sensitive Feature Selection for Class Imbalance Problem
    Bach, Malgorzata
    Werner, Aleksandra
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 182 - 194
  • [7] Building cost-sensitive decision trees for medical applications
    Freitas, Alberto
    AI COMMUNICATIONS, 2011, 24 (03) : 285 - 287
  • [8] Example-dependent cost-sensitive decision trees
    Bahnsen, Alejandro Correa
    Aouada, Djamila
    Ottersten, Bjoern
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (19) : 6609 - 6619
  • [9] Simple test strategies for cost-sensitive decision trees
    Sheng, SL
    Ling, CX
    Yang, Q
    MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 365 - +
  • [10] A Cost-Sensitive Sparse Representation Based Classification for Class-Imbalance Problem
    Liu, Zhenbing
    Gao, Chunyang
    Yang, Huihua
    He, Qijia
    SCIENTIFIC PROGRAMMING, 2016, 2016