Development of a Neighborhood Based Adaptive Heterogeneous Oversampling Ensemble Classifier for Imbalanced Binary Class Datasets

被引:0
作者
Subbulaxmi, S. Santha [1 ]
Arumugam, G. [2 ]
机构
[1] Madurai Kamaraj Univ, Dept Comp Sci, Madurai, Tamil Nadu, India
[2] Madurai Kamaraj Univ, Sch Informat Technol, Dept Comp Sci, Madurai, Tamil Nadu, India
来源
PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2022 | 2023年 / 475卷
关键词
Imbalanced data; Classification; Ensemble classifier; Heterogeneous ensemble; Multiple classifiers;
D O I
10.1007/978-981-19-2840-6_28
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Class imbalance prevails in many real-word datasets. In this paper, a Neighborhood based Adaptive Heterogeneous Oversampling Ensemble Classifier method is proposed to handle class imbalance in datasets. The proposed method adopts an oversampling approach to create a set of balanced representative training datasets. Several base classifiers are built based on those training datasets, and an adaptive heterogeneous ensemble classifier is created. The proposed method is examined with five datasets, and examination results are compared with popular oversampling algorithms. The comparison revealed that proposed method is able to achieve better performance results.
引用
收藏
页码:353 / 361
页数:9
相关论文
共 22 条
[1]  
Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[2]  
Branco P., 2017, Proc. Mach. Learn. Res, P36
[3]  
Branco P, 2016, Arxiv, DOI arXiv:1604.08079
[4]   Weighted Data Gravitation Classification for Standard and Imbalanced Data [J].
Cano, Alberto ;
Zafra, Amelia ;
Ventura, Sebastian .
IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (06) :1672-1687
[5]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[6]   Hellinger distance decision trees are robust and skew-insensitive [J].
Cieslak, David A. ;
Hoens, T. Ryan ;
Chawla, Nitesh V. ;
Kegelmeyer, W. Philip .
DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 24 (01) :136-158
[7]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[8]   Evolutionary Sampling and Software Quality Modeling of High-Assurance Systems [J].
Drown, Dennis J. ;
Khoshgoftaar, Taghi M. ;
Seliya, Naeem .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2009, 39 (05) :1097-1107
[9]  
GATES GW, 1972, IEEE T INFORM THEORY, V18, P431, DOI 10.1109/TIT.1972.1054809
[10]   ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning [J].
He, Haibo ;
Bai, Yang ;
Garcia, Edwardo A. ;
Li, Shutao .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1322-1328