Swift Imbalance Data Classification using SMOTE and Extreme Learning Machine

被引:9
作者
Rustogi, Rishabh [1 ]
Prasad, Ayush [1 ]
机构
[1] Shiv Nadar Univ, Dept Comp Sci, Greater Noida, Uttar Pradesh, India
来源
2019 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS 2019) | 2019年
关键词
Imbalanced Data; Data Classification; Extreme Learning Machine; SMOTE; Condensed Nearest-Neighbor; Tomek Links;
D O I
10.1109/iccids.2019.8862112
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Continuous expansion in the fields of science and technology has led to the immense availability and attainability of data in every field. Fundamentally understanding and analyzing this data is a critical job in the decision-making process. Although, great success has been achieved by the prevailing data engineering and mining techniques, the problem of swift classification of the imbalanced data still exists in academia and industry. A potential solution to the problem of skewness in data can be resolved by data upsampling or downsampling. There exists a few techniques that firstly remove skewness and then perform classification, however, these methods suffer from hurdles like abortive precision or slower learning rate. In this paper, a hybrid method to classify binary imbalanced data using Synthetic Minority Over-sampling Technique followed by Extreme Learning Machine is proposed. Our method along with swift learning rate is efficacious to predict the desired class. We verified our model using five standard imbalance dataset and obtained higher F-measure, G-mean and ROC score for all the dataset.
引用
收藏
页数:6
相关论文
共 29 条
  • [1] Bunke H., 1997, IEEE T PATTERN ANAL
  • [2] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [3] Chen Y., 2013, NEUROCOMPUTING
  • [4] Deng Wan-Yu, 2010, Chinese Journal of Computers, V33, P279, DOI 10.3724/SP.J.1016.2010.00279
  • [5] Elkan C., 2001, P 17 INT JOINT C ART, P973
  • [6] A multiple resampling method for learning from imbalanced data sets
    Estabrooks, A
    Jo, TH
    Japkowicz, N
    [J]. COMPUTATIONAL INTELLIGENCE, 2004, 20 (01) : 18 - 36
  • [7] Garcia E. A., 2008, IEEE T KNOWLEDGE DAT
  • [8] Gustavo E. A., 2004, SIGKDD Explor., V200, P20, DOI DOI 10.1145/1007730.1007735
  • [9] Haixiang G., 2017, EXPERT SYSTEMS APPL, V73
  • [10] Herrera F., 2008, FUZZY SETS SYSTEMS