IA-SUWO: An Improving Adaptive semi-unsupervised weighted oversampling for imbalanced classification problems

被引:24
作者
Wei Jianan [1 ]
Huang Haisong [1 ]
Yao Liguo [1 ,2 ]
Hu Yao [1 ,3 ]
Fan Qingsong [1 ]
Huang Dong [1 ]
机构
[1] Guizhou Univ, Key Lab Adv Mfg Technol, Minist Educ, Guiyang 550025, Guizhou, Peoples R China
[2] Yuan Ze Univ, Dept Ind Engn & Management, Taoyuan 32003, Taiwan
[3] Guizhou Renhe Zhiyuan Data Serv Co Ltd, Guiyang 550025, Guizhou, Peoples R China
关键词
Imbalanced classification; Least squares support numerical spectrum; Minority samples weights; Oversampling; k* information nearest neighbors; SMOTE; MACHINE;
D O I
10.1016/j.knosys.2020.106116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the essence of machine learning, classification is widely used in real life, however, imbalanced data has brought great challenges to classification problems. This is because standard classifiers tend to favor the majority instances and ignore the minority instances. The new oversampling algorithms (e.g. A-SUWO) based on the improving majority weighted minority oversampling (IMWMO) method assign weights through the Euclidean distances from majority instances to hard-to-learn minority instances, and then guide the synthesis of minority samples according to the weights to address the offset of the classification hyperplanes. A-SUWO has achieved better results than traditional oversampling algorithms (e.g. SMOTE and MWMOTE, etc.), when its parameters are well adjusted. However, A-SUWO may give minority training samples inappropriate weights in some irregularly distributed scenarios and make learning tasks even more harder. Additionally, A-SUWO's knn synthesizing method may not obtain wider and more effective instances. Therefore, we propose an improving adaptive semi-unsupervised weighted oversampling (IA-SUWO) technique to address the imbalanced classification problems more effectively. The improvement of IA-SUWO mainly focuses on the following two aspects: (1) comprehensively considering the least squares support numerical spectrum values and the IMWMO method to assign weights to minority instances, and (2) synthesizing new instances using the k* information nearest neighbors (k*INN) method. IA-SUWO aims to maximize the probability that all important minority samples will be drawn and generates more efficient (more scattered) boundary instances. Results demonstrate that IA-SUWO achieves significantly better results in most datasets compared with other 10 oversampling algorithms and 2 ensemble algorithms. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:19
相关论文
共 5 条
  • [1] Adaptive semi-unsupervised weighted oversampling (A-SUWO) for imbalanced datasets
    Nekooeimehr, Iman
    Lai-Yuen, Susana K.
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 46 : 405 - 416
  • [2] Improved Adaptive Semi-Unsupervised Weighted Oversampling using Sparsity Factor for Imbalanced Datasets
    Ali, Haseeb
    Salleh, Mohd Najib Mohd
    Hussain, Kashif
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (11) : 372 - 383
  • [3] An Improving Majority Weighted Minority Oversampling Technique for Imbalanced Classification Problem
    Wang, Chao-Ran
    Shao, Xin-Hui
    IEEE ACCESS, 2021, 9 : 5069 - 5082
  • [4] NI-MWMOTE: An improving noise-immunity majority weighted minority oversampling technique for imbalanced classification problems
    Wei, Jianan
    Huang, Haisong
    Yao, Liguo
    Hu, Yao
    Fan, Qingsong
    Huang, Dong
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158
  • [5] Self-adaptive Weighted Extreme Learning Machine for Imbalanced Classification Problems
    Long, Hao
    He, Yulin
    Huang, Joshua Zhexue
    Wang, Qiang
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2017, 2017, 10526 : 116 - 128