Borderline-margin loss based deep metric learning framework for imbalanced data

被引:8
作者
Yan, Mi [1 ,2 ]
Li, Ning [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Minist Educ China, Dept Automat, Key Lab Syst Control & Informat Proc, Shanghai 200240, Peoples R China
[2] Shanghai Engn Res Ctr Intelligent Control & Manag, Shanghai 200240, Peoples R China
基金
国家重点研发计划;
关键词
Imbalanced classification; Class imbalance; Class overlap; Deep metric framework; Borderline-margin loss; SMOTE; CLASSIFICATION; MACHINE; COST;
D O I
10.1007/s10489-022-03494-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The imbalanced data suffer the problem where minority class is under-represented compared with majority ones. Traditional imbalanced learning algorithms only consider the class imbalance while ignoring the class overlap, which leads to an undesirable accuracy for minority samples in overlapping regions. Considering the above issue, we propose a deep metric framework with borderline-margin loss (DMFBML) for improving the intra-class coherence and inter-class difference in overlapping regions. Firstly, a flexible borderline margin is designed for each minority sample, which is adaptively adjusted according to the neighborhood's label. The proposed margin enables to discriminate minority samples with varying overlap degrees, which significantly preserves the valuable information of classification boundary. The input data is then reconstructed into training triplets set to generate more metric constraints for minority samples, thereby increasing the difference in overlapping regions. Finally, a neural network with DMFBML is presented to achieve a better classifier performance on imbalanced data. The proposed method is verified by the comparative experiments on six synthetic datasets and eleven actual datasets.
引用
收藏
页码:1487 / 1504
页数:18
相关论文
共 67 条
[21]   Deep Metric Learning with Hierarchical Triplet Loss [J].
Ge, Weifeng ;
Huang, Weilin ;
Dong, Dengke ;
Scott, Matthew R. .
COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :272-288
[22]   A Quadruplet Deep Metric Learning model for imbalanced time-series fault diagnosis [J].
Gui, Xingtai ;
Zhang, Jiyang ;
Tang, Jianxiong ;
Xu, Hongbing ;
Zou, Jianxiao ;
Fan, Shicai .
KNOWLEDGE-BASED SYSTEMS, 2022, 238
[23]   Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning [J].
Han, H ;
Wang, WY ;
Mao, BH .
ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 :878-887
[24]   Learning from Imbalanced Data [J].
He, Haibo ;
Garcia, Edwardo A. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1263-1284
[25]   ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning [J].
He, Haibo ;
Bai, Yang ;
Garcia, Edwardo A. ;
Li, Shutao .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1322-1328
[26]   Triplet-Center Loss for Multi-View 3D Object Retrieval [J].
He, Xinwei ;
Zhou, Yang ;
Zhou, Zhichao ;
Bai, Song ;
Bai, Xiang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1945-1954
[27]   Learning Deep Representation for Imbalanced Classification [J].
Huang, Chen ;
Li, Yining ;
Loy, Chen Change ;
Tang, Xiaoou .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5375-5384
[28]   Memory-Augmented Convolutional Neural Networks With Triplet Loss for Imbalanced Wafer Defect Pattern Classification [J].
Hyun, Yunseung ;
Kim, Heeyoung .
IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2020, 33 (04) :622-634
[29]   A wind turbine frequent principal fault detection and localization approach with imbalanced data using an improved synthetic oversampling technique [J].
Jiang, Na ;
Li, Ning .
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2021, 126 (126)
[30]  
Jo T., 2004, ACM SIGKDD Explor. Newslett., V6, P40