IRDA: Implicit data augmentation for deep imbalanced regression

被引:1
|
作者
Zhu, Weiyao [1 ]
Wu, Ou [1 ]
Yang, Nan [1 ]
机构
[1] Tianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
关键词
Deep imbalanced regression; Implicit data augmentation; Regularization; Regression loss;
D O I
10.1016/j.ins.2024.120873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced data distributions are prevalent in real -world classification and regression tasks. Data augmentation is a commonly employed technique to mitigate this issue, with implicit methods gaining attention for their effectiveness and efficiency. However, implicit data augmentation methods have not been extensively explored in the context of regression tasks. To address this gap, we introduce IRDA, a novel learning method for regression that incorporates implicit data augmentation. Our approach includes developing a new augmentation strategy specifically tailored for deep imbalanced regression tasks, and a regression loss function that is suitable for real -world data with imbalanced label distributions and non -uniformly distributed features. We derive an easily computable surrogate loss and propose two implicit data augmentation algorithms, one incorporating meta -learning and one without. Additionally, we provide regularization perspective to offer a deeper understanding of IRDA. We evaluate IRDA on five datasets, including a large-scale dataset, demonstrating its effectiveness in mitigating the adverse effects of imbalanced data distribution and its adaptability to various regression tasks.
引用
收藏
页数:20
相关论文
共 50 条
  • [11] INVERSE REGRESSION FOR LONGITUDINAL DATA
    Jiang, Ci-Ren
    Yu, Wei
    Wang, Jane-Ling
    ANNALS OF STATISTICS, 2014, 42 (02): : 563 - 591
  • [12] Nonlinear system identification via data augmentation
    Formentin, Simone
    Mazzoleni, Mirko
    Scandella, Matteo
    Previdi, Fabio
    SYSTEMS & CONTROL LETTERS, 2019, 128 : 56 - 63
  • [13] Implicit Seismic Full Waveform Inversion With Deep Neural Representation
    Sun, Jian
    Innanen, Kristopher
    Zhang, Tianze
    Trad, Daniel
    JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2023, 128 (03)
  • [14] Linear regression models for functional data
    Cardot, Herve
    Sarda, Pascal
    ART OF SEMIPARAMETRICS, 2006, : 49 - +
  • [15] On Data-Enriched Logistic Regression
    Zheng, Cheng
    Dasgupta, Sayan
    Xie, Yuxiang
    Haris, Asad
    Chen, Ying-Qing
    MATHEMATICS, 2025, 13 (03)
  • [16] Further Advantages of Data Augmentation on Convolutional Neural Networks
    Hernandez-Garcia, Alex
    Koenig, Peter
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 95 - 103
  • [17] Regression Models for Multivariate Count Data
    Zhang, Yiwen
    Zhou, Hua
    Zhou, Jin
    Sun, Wei
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (01) : 1 - 13
  • [18] REGRESSION ANALYSIS FOR MICROBIOME COMPOSITIONAL DATA
    Shi, Pixu
    Zhang, Anru
    Li, Hongzhe
    ANNALS OF APPLIED STATISTICS, 2016, 10 (02): : 1019 - 1040
  • [19] An efficient weighted Lagrangian twin support vector machine for imbalanced data classification
    Shao, Yuan-Hai
    Chen, Wei-Jie
    Zhang, Jing-Jing
    Wang, Zhen
    Deng, Nai-Yang
    PATTERN RECOGNITION, 2014, 47 (09) : 3158 - 3167
  • [20] Optimizing Weighted ELM Based on Gray Wolf Optimizer for Imbalanced Data Classification
    Thammasakorn, Chudapa
    Chiewchanwattana, Sirapat
    Sunat, Khamron
    PROCEEDINGS OF 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2018, : 512 - 517