Class-Adaptive Data Augmentation for Image Classification

被引:20
作者
Yoo, Jisu [1 ]
Kang, Seokho [1 ]
机构
[1] Sungkyunkwan Univ, Dept Ind Engn, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
Training data; Optimization; Image classification; Task analysis; Convolutional neural networks; Law enforcement; Image color analysis; data augmentation; class-adaptive data augmentation; hyperparameter optimization;
D O I
10.1109/ACCESS.2023.3258179
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data augmentation is a widely used regularization technique for improving the performance of convolutional neural networks (CNNs) in image classification tasks. To improve the effectiveness of data augmentation, it is important to find label-preserving transformations that fit the domain knowledge for a given dataset. In several real-world datasets, appropriate augmentation policies differ between classes, owing to their different characteristics. In this paper, we propose a class-adaptive data augmentation method that utilizes class-specific augmentation policies. First, we train the CNN without data augmentation. Subsequently, we derive a suitable augmentation policy for each class through an optimization procedure to maximize the degree of transformation while maintaining the label-preserving property of CNNs. Finally, we re-train the model using data augmentation based on derived class-specific augmentation policies. Through experiments using benchmark datasets with class-specific transformation constraints, we demonstrate that the proposed method achieves comparable or higher classification accuracy than the baseline methods using the same augmentation policy for all classes. Additionally, we confirm that the derived class-specific augmentation policies are consistent with the domain knowledge of each dataset.
引用
收藏
页码:26393 / 26402
页数:10
相关论文
共 31 条
[1]  
Cheung D.-Y., 2022, INT C LEARNING REPRE
[2]   AutoAugment: Learning Augmentation Strategies from Data [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Mane, Dandelion ;
Vasudevan, Vijay ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :113-123
[3]   Fundamental Technologies in Modern Speech Recognition [J].
Furui, Sadaoki ;
Deng, Li ;
Gales, Mark ;
Ney, Hermann ;
Tokuda, Keiichi .
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :16-17
[4]   Faster AutoAugment: Learning Augmentation Strategies Using Backpropagation [J].
Hataya, Ryuichiro ;
Zdenek, Jan ;
Yoshizoe, Kazuki ;
Nakayama, Hideki .
COMPUTER VISION - ECCV 2020, PT XXV, 2020, 12370 :1-16
[5]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[6]  
He Z., 2019, arXiv
[7]  
Ho D, 2019, PR MACH LEARN RES, V97
[8]   Semi-supervised Wafer Map Pattern Recognition using Domain-Specific Data Augmentation and Contrastive Learning [J].
Hu, Hanbin ;
He, Chen ;
Li, Peng .
2021 IEEE INTERNATIONAL TEST CONFERENCE (ITC 2021), 2021, :113-122
[9]  
Inoue H, 2018, Arxiv, DOI [arXiv:1801.02929, 10.48550/arXiv.1801.02929]
[10]  
Ioffe Sergey, 2015, Proceedings of Machine Learning Research, V37, P448