The Impact of Local Data Characteristics on Learning from Imbalanced Data

被引:0
作者
Stefanowski, Jerzy [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, PL-60965 Poznan, Poland
来源
ROUGH SETS AND INTELLIGENT SYSTEMS PARADIGMS, RSEISP 2014 | 2014年 / 8537卷
关键词
RULE INDUCTION; CLASSIFICATION; CLASSIFIERS; SMOTE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Problems of learning classifiers from imbalanced data are discussed. First, we look at different data difficulty factors corresponding to complex distributions of the minority class and show that they could be approximated by analysing the neighbourhood of the learning examples from the minority class. We claim that the results of this analysis could be a basis for developing new algorithms. In this paper we show such possibilities by discussing modifications of informed pre-processing method LN-SMOTE as well as by incorporating types of examples into rule induction algorithm BRACID.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 44 条
[1]   Learning classification rules from data [J].
An, A .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2003, 45 (4-5) :737-748
[2]  
[Anonymous], 2012, Foundations of Rule Learning
[3]  
[Anonymous], 2004, ACM SIGKDD EXPLORATI, DOI DOI 10.1145/1007730.1007737
[4]  
[Anonymous], 1997, P 14 INT C ONMACHINE
[5]  
Anyfamis D, 2007, INT FED INFO PROC, P21
[6]  
Batista G. E., 2004, ACM SIGKDD Explor. Newslett., P20, DOI [10.1145/1007730.1007735, DOI 10.1145/1007730.1007735]
[7]  
Blaszczynski J., 2013, P COPEM 2013 SOLV CO, P10
[8]  
Blaszczynski J., 2013, CORES 2013 AISC, V226, P273
[9]  
Bunkhumpornpat C, 2009, LECT NOTES ARTIF INT, V5476, P475, DOI 10.1007/978-3-642-01307-2_43
[10]  
Chawla NV, 2005, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, P853, DOI 10.1007/0-387-25465-X_40