Evaluation Measures of the Classification Performance of Imbalanced Data Sets

被引:199
作者
Gu, Qiong [1 ,2 ]
Zhu, Li [2 ]
Cai, Zhihua [2 ]
机构
[1] Xiangfan Univ, Fac Math & Comp Sci, Xiangfan 441053, Hubei, Peoples R China
[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China
来源
COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS | 2009年 / 51卷
关键词
Evaluation; classification performance; imbalanced data sets;
D O I
10.1007/978-3-642-04962-0_53
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminant Measures for Classification Performance play a critical role in guiding the design of classifiers, assessment methods and evaluation measures are at least as important as algorithm and are the first key stage to a successful data mining. We systematically summarized the evaluation measures of Imbalanced Data Sets (IDS). Several different type measures, such as commonly performance evaluation measures and visualizing classifier performance measures have been analyzed and compared. The problems of these measures towards IDS may lead to misunderstanding of classification results and even wrong strategy decision. Beside that, a series of complex numerical evaluation measures were also investigated which can also serve for evaluating classification performance of IDS.
引用
收藏
页码:461 / +
页数:2
相关论文
共 50 条
[11]   A LEARNING METHOD FOR IMBALANCED DATA SETS [J].
de la Calleja, Jorge ;
Fuentes, Olac ;
Gonzalez, Jesus ;
Aceves-Perez, Rita M. .
KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, :307-+
[12]   On the 2-tuples based genetic tuning performance for fuzzy rule based classification systems in imbalanced data-sets [J].
Fernandez, Alberto ;
Jose del Jesus, Maria ;
Herrera, Francisco .
INFORMATION SCIENCES, 2010, 180 (08) :1268-1291
[13]   A First Study on the Use of Interval-Valued Fuzzy Sets with Genetic Tuning for Classification with Imbalanced Data-Sets [J].
Sanz, J. ;
Fernandez, A. ;
Bustince, H. ;
Herrera, F. .
HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2009, 5572 :581-+
[14]   (1+ε)-class Classification: an Anomaly Detection Method for Highly Imbalanced or Incomplete Data Sets [J].
Borisyak, Maxim ;
Ryzhikov, Artem ;
Ustyuzhanin, Andrey ;
Derkach, Denis ;
Ratnikov, Fedor ;
Mineeva, Olga .
JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
[15]   An Optimized Random Forest Classification Method for Processing Imbalanced Data Sets of Alzheimer's Disease [J].
Sun, Haijing ;
Wang, Anna ;
Feng, Yun ;
Liu, Chen .
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, :1670-1673
[16]   Improving the Performance of Fuzzy Rule Based Classification Systems for Highly Imbalanced Data-Sets Using an Evolutionary Adaptive Inference System [J].
Fernandez, Alberto ;
Jose del Jesus, Maria ;
Herrera, Francisco .
BIO-INSPIRED SYSTEMS: COMPUTATIONAL AND AMBIENT INTELLIGENCE, PT 1, 2009, 5517 :294-+
[17]   Applying MASI Algorithm to Improve the Classification Performance of Imbalanced Data in Fraud Detection [J].
Thi-Lich Nghiem ;
Thi-Toan Nghiem .
ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING (ICCSAMA 2019), 2020, 1121 :150-162
[18]   On the influence of an adaptive inference system in fuzzy rule based classification systems for imbalanced data-sets [J].
Fernandez, Alberto ;
Jose del Jesus, Maria ;
Herrera, Francisco .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) :9805-9812
[19]   A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data-sets [J].
Fernandez, Alberto ;
Garcia, Salvador ;
Jose del Jesus, Maria ;
Herrera, Francisco .
FUZZY SETS AND SYSTEMS, 2008, 159 (18) :2378-2398
[20]   Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets [J].
Fernandez, Alberto ;
del Jesus, Maria Jose ;
Herrera, Francisco .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2009, 50 (03) :561-577