Classification of Imbalanced Data Using Deep Learning with Adding Noise

被引:6
作者
Fan, Wan-Wei [1 ]
Lee, Ching-Hung [1 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu, Taiwan
关键词
SURFACE DEFECT DETECTION;
D O I
10.1155/2021/1735386
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a method to treat the classification of imbalanced data by adding noise to the feature space of convolutional neural network (CNN) without changing a data set (ratio of majority and minority data). Besides, a hybrid loss function of crossentropy and KL divergence is proposed. The proposed approach can improve the accuracy of minority class in the testing data. In addition, a simple design method for selecting structure of CNN is first introduced and then, we add noise in feature space of CNN to obtain proper features by a training process and to improve the classification results. From comparison results, we can find that the proposed method can extract the suitable features to improve the accuracy of minority class. Finally, illustrated examples of multiclass classification problems and the corresponding discussion in balance ratio are presented. Our approach performs well with smaller network structure compared with other deep models. In addition, the performance is improved over 40% in defective accuracy by adding noise approach. Finally, the accuracy is higher than 96%; even the imbalanced ratio (IR) is one hundred.
引用
收藏
页数:18
相关论文
共 54 条
  • [51] Yap B. W., 2014, P 1 INT C ADV DAT IN, V285, P13, DOI DOI 10.1007/978-981-4585-18-7_2
  • [52] Yin J, 2020, AAAI CONF ARTIF INTE, V34, P6680
  • [53] CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features
    Yun, Sangdoo
    Han, Dongyoon
    Oh, Seong Joon
    Chun, Sanghyuk
    Choe, Junsuk
    Yoo, Youngjoon
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6022 - 6031
  • [54] Zhang H, 2018, P INT C LEARN REPR I, DOI DOI 10.48550/ARXIV.1710.09412