IRDA: Implicit data augmentation for deep imbalanced regression

被引：1

作者：

Zhu, Weiyao ^{[1
]}

Wu, Ou ^{[1
]}

Yang, Nan ^{[1
]}

机构：

[1] Tianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China

来源：

INFORMATION SCIENCES | 2024年 / 677卷

关键词：

Deep imbalanced regression; Implicit data augmentation; Regularization; Regression loss;

D O I：

10.1016/j.ins.2024.120873

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Imbalanced data distributions are prevalent in real -world classification and regression tasks. Data augmentation is a commonly employed technique to mitigate this issue, with implicit methods gaining attention for their effectiveness and efficiency. However, implicit data augmentation methods have not been extensively explored in the context of regression tasks. To address this gap, we introduce IRDA, a novel learning method for regression that incorporates implicit data augmentation. Our approach includes developing a new augmentation strategy specifically tailored for deep imbalanced regression tasks, and a regression loss function that is suitable for real -world data with imbalanced label distributions and non -uniformly distributed features. We derive an easily computable surrogate loss and propose two implicit data augmentation algorithms, one incorporating meta -learning and one without. Additionally, we provide regularization perspective to offer a deeper understanding of IRDA. We evaluate IRDA on five datasets, including a large-scale dataset, demonstrating its effectiveness in mitigating the adverse effects of imbalanced data distribution and its adaptability to various regression tasks.

引用

页数：20

共 50 条

[41] A Survey of Mix-based Data Augmentation: Taxonomy, Methods, Applications, and Explainability
Cao, Chengtai
Zhou, Fan
Dai, Yurou
Wang, Jianping
Zhang, Kunpeng
ACM COMPUTING SURVEYS, 2025, 57 (02)
[42] DL-Reg: A deep learning regularization technique using linear regression
Dialameh, Maryam
Hamzeh, Ali
Rahmani, Hossein
Dialameh, Safoura
Kwon, Hyock Ju
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 247
[43] Semi-supervised regression with manifold: A Bayesian deep kernel learning approach
Xu, Lu
Hu, Chen
Mei, Kuizhi
NEUROCOMPUTING, 2022, 497 : 76 - 85
[44] Prediction-based regularization using data augmented regression
Giles Hooker
Saharon Rosset
Statistics and Computing, 2012, 22 : 237 - 249
[45] Incorporating Predictor Network in Penalized Regression with Application to Microarray Data
Pan, Wei
Xie, Benhuai
Shen, Xiaotong
BIOMETRICS, 2010, 66 (02) : 474 - 484
[46] Bootstrap Enhanced Penalized Regression for Variable Selection with Neuroimaging Data
Abram, Samantha V.
Helwig, Nathaniel E.
Moodie, Craig A.
DeYoung, Colin G.
MacDonald, Angus W., III
Waller, Niels G.
FRONTIERS IN NEUROSCIENCE, 2016, 10
[47] Multi-task ordinal regression with labeled and unlabeled data
Xiao, Yanshan
Zhang, Liangwang
Liu, Bo
Cai, Ruichu
Hao, Zhifeng
INFORMATION SCIENCES, 2023, 649
[48] Prediction-based regularization using data augmented regression
Hooker, Giles
Rosset, Saharon
STATISTICS AND COMPUTING, 2012, 22 (01) : 237 - 249
[49] Optimized application of penalized regression methods to diverse genomic data
Waldron, Levi
Pintilie, Melania
Tsao, Ming-Sound
Shepherd, Frances A.
Huttenhower, Curtis
Jurisica, Igor
BIOINFORMATICS, 2011, 27 (24) : 3399 - 3406
[50] Matrix variate logistic regression model with application to EEG data
Hung, Hung
Wang, Chen-Chien
BIOSTATISTICS, 2013, 14 (01) : 189 - 202

← 1 2 3 4 5 →