IRDA: Implicit data augmentation for deep imbalanced regression

被引:1
|
作者
Zhu, Weiyao [1 ]
Wu, Ou [1 ]
Yang, Nan [1 ]
机构
[1] Tianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
关键词
Deep imbalanced regression; Implicit data augmentation; Regularization; Regression loss;
D O I
10.1016/j.ins.2024.120873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced data distributions are prevalent in real -world classification and regression tasks. Data augmentation is a commonly employed technique to mitigate this issue, with implicit methods gaining attention for their effectiveness and efficiency. However, implicit data augmentation methods have not been extensively explored in the context of regression tasks. To address this gap, we introduce IRDA, a novel learning method for regression that incorporates implicit data augmentation. Our approach includes developing a new augmentation strategy specifically tailored for deep imbalanced regression tasks, and a regression loss function that is suitable for real -world data with imbalanced label distributions and non -uniformly distributed features. We derive an easily computable surrogate loss and propose two implicit data augmentation algorithms, one incorporating meta -learning and one without. Additionally, we provide regularization perspective to offer a deeper understanding of IRDA. We evaluate IRDA on five datasets, including a large-scale dataset, demonstrating its effectiveness in mitigating the adverse effects of imbalanced data distribution and its adaptability to various regression tasks.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] A Survey of Mix-based Data Augmentation: Taxonomy, Methods, Applications, and Explainability
    Cao, Chengtai
    Zhou, Fan
    Dai, Yurou
    Wang, Jianping
    Zhang, Kunpeng
    ACM COMPUTING SURVEYS, 2025, 57 (02)
  • [42] DL-Reg: A deep learning regularization technique using linear regression
    Dialameh, Maryam
    Hamzeh, Ali
    Rahmani, Hossein
    Dialameh, Safoura
    Kwon, Hyock Ju
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
  • [43] Semi-supervised regression with manifold: A Bayesian deep kernel learning approach
    Xu, Lu
    Hu, Chen
    Mei, Kuizhi
    NEUROCOMPUTING, 2022, 497 : 76 - 85
  • [44] Prediction-based regularization using data augmented regression
    Giles Hooker
    Saharon Rosset
    Statistics and Computing, 2012, 22 : 237 - 249
  • [45] Incorporating Predictor Network in Penalized Regression with Application to Microarray Data
    Pan, Wei
    Xie, Benhuai
    Shen, Xiaotong
    BIOMETRICS, 2010, 66 (02) : 474 - 484
  • [46] Bootstrap Enhanced Penalized Regression for Variable Selection with Neuroimaging Data
    Abram, Samantha V.
    Helwig, Nathaniel E.
    Moodie, Craig A.
    DeYoung, Colin G.
    MacDonald, Angus W., III
    Waller, Niels G.
    FRONTIERS IN NEUROSCIENCE, 2016, 10
  • [47] Multi-task ordinal regression with labeled and unlabeled data
    Xiao, Yanshan
    Zhang, Liangwang
    Liu, Bo
    Cai, Ruichu
    Hao, Zhifeng
    INFORMATION SCIENCES, 2023, 649
  • [48] Prediction-based regularization using data augmented regression
    Hooker, Giles
    Rosset, Saharon
    STATISTICS AND COMPUTING, 2012, 22 (01) : 237 - 249
  • [49] Optimized application of penalized regression methods to diverse genomic data
    Waldron, Levi
    Pintilie, Melania
    Tsao, Ming-Sound
    Shepherd, Frances A.
    Huttenhower, Curtis
    Jurisica, Igor
    BIOINFORMATICS, 2011, 27 (24) : 3399 - 3406
  • [50] Matrix variate logistic regression model with application to EEG data
    Hung, Hung
    Wang, Chen-Chien
    BIOSTATISTICS, 2013, 14 (01) : 189 - 202