Evaluation Measures of the Classification Performance of Imbalanced Data Sets

被引:185
作者
Gu, Qiong [1 ,2 ]
Zhu, Li [2 ]
Cai, Zhihua [2 ]
机构
[1] Xiangfan Univ, Fac Math & Comp Sci, Xiangfan 441053, Hubei, Peoples R China
[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China
来源
COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS | 2009年 / 51卷
关键词
Evaluation; classification performance; imbalanced data sets;
D O I
10.1007/978-3-642-04962-0_53
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminant Measures for Classification Performance play a critical role in guiding the design of classifiers, assessment methods and evaluation measures are at least as important as algorithm and are the first key stage to a successful data mining. We systematically summarized the evaluation measures of Imbalanced Data Sets (IDS). Several different type measures, such as commonly performance evaluation measures and visualizing classifier performance measures have been analyzed and compared. The problems of these measures towards IDS may lead to misunderstanding of classification results and even wrong strategy decision. Beside that, a series of complex numerical evaluation measures were also investigated which can also serve for evaluating classification performance of IDS.
引用
收藏
页码:461 / +
页数:2
相关论文
共 50 条
  • [31] Random Forests lithology prediction method for imbalanced data sets
    Wang G.
    Song J.
    Xu F.
    Zhang W.
    Liu J.
    Chen F.
    [J]. Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2021, 56 (04): : 679 - 687
  • [32] A hierarchical genetic fuzzy system based on genetic programming for addressing classification with highly imbalanced and borderline data-sets
    Lopez, Victoria
    Fernandez, Alberto
    Jose del Jesus, Maria
    Herrera, Francisco
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 38 : 85 - 104
  • [33] Imbalanced Data Classification Algorithm Based on Clustering and SVM
    Huang, Bo
    Zhu, Yimin
    Wang, Zhongzhen
    Fang, Zhijun
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (02)
  • [34] An Adaptive Sampling Ensemble Classifier for Learning from Imbalanced Data Sets
    Geiler, Ordonez Jon
    Hong, Li
    Yue-Jian, Guo
    [J]. INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 513 - 517
  • [35] A new robust fuzzy clustering validity index for imbalanced data sets
    Liu, Yun
    Jiang, Yanfang
    Hou, Tao
    Liu, Fu
    [J]. INFORMATION SCIENCES, 2021, 547 (547) : 579 - 591
  • [36] A study on the use of the fuzzy reasoning method based on the winning rule vs. voting procedure for classification with imbalanced data sets
    Fernandez, Alberto
    Garcia, Salvador
    Jos del Jesus, Maria
    Herrera, Francisco
    [J]. COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 375 - +
  • [37] Studying the Behavior of a Multiobjective Genetic Algorithm to design Fuzzy Rule-Based Classification Systems for Imbalanced Data-Sets
    Villar, Pedro
    Fernandez, Alberto
    Herrera, Francisco
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1239 - 1246
  • [38] FEATURE SELECTION AND GRANULARITY LEARNING IN GENETIC FUZZY RULE-BASED CLASSIFICATION SYSTEMS FOR HIGHLY IMBALANCED DATA-SETS
    Villar, Pedro
    Fernandez, Alberto
    Carrasco, Ramon A.
    Herrera, Francisco
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2012, 20 (03) : 369 - 397
  • [39] An approach for classification of highly imbalanced data using weighting and undersampling
    Ashish Anand
    Ganesan Pugalenthi
    Gary B. Fogel
    P. N. Suganthan
    [J]. Amino Acids, 2010, 39 : 1385 - 1391
  • [40] An improved weighted extreme learning machine for imbalanced data classification
    Lu, Chengbo
    Ke, Haifeng
    Zhang, Gaoyan
    Mei, Ying
    Xu, Huihui
    [J]. MEMETIC COMPUTING, 2019, 11 (01) : 27 - 34