Augmentation, Mixing, and Consistency Regularization for Domain Generalization

被引：1

作者：

Mehmood, Noaman ^{[1
]}

Barner, Kenneth ^{[1
]}

机构：

[1] Univ Delaware, Elect & Comp Engn Dept, Newark, DE 19711 USA

来源：

2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024 | 2024年

关键词：

generalization; augmentation; discriminative; regularization; consistency;

D O I：

10.1109/ICMI60790.2024.10585938

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work addresses the prevalent challenge in contemporary deep neural networks (DNNs), in which performance diminishes when confronted with testing data that differ in distribution from the training data. We approach this problem using Domain Generalization (DG), training the model on multiple related source domains to bolster its performance on an unseen target domain. Our approach is a tripartite solution involving data augmentation, mixing of style information, and consistency regularization. Data augmentation is achieved by mixing the amplitude spectrum of two distinct images in the frequency domain. Given that image style is intrinsically tied to the visual domain, the style information is mixed into the lower layers of the neural network. This step familiarizes the model with a range of features, enhancing its ability to generalize on unseen target data. Consistency regularization is then introduced to reduce the prediction error between the original and augmented samples, further elevating the performance. Using three distinct benchmarks, extensive experiments were performed that confirm the Augmentation, Mixing, and Consistency Regularization (AMCR) framework on unseen target domains in comparison with existing state-of-the-art (SOTA) methods. The findings underscore the value of DNNs that can effectively generalize across diverse environments, particularly in real-world applications such as autonomous driving.

引用

页数：6

共 32 条

[1] Balaji Y, 2018, ADV NEUR IN, V31
[2] Blanchard G., 2011, ADV NEURAL INFORM PR, V24, P2178
[3] Domain Generalization by Solving Jigsaw Puzzles
Carlucci, Fabio M.
D'Innocente, Antonio
Bucci, Silvia
Caputo, Barbara
Tommasi, Tatiana
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2224 - 2233
[4] Finn C, 2017, PR MACH LEARN RES, V70
[5] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[6] Hinton G, 2015, Arxiv, DOI [arXiv:1503.02531, DOI 10.48550/ARXIV.1503.02531]
[7] Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
Huang, Xun
Belongie, Serge
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1510 - 1519
[8] Kaiyang Zhou, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12361), P561, DOI 10.1007/978-3-030-58517-4_33
[9] Laine S, 2017, Arxiv, DOI [arXiv:1610.02242, DOI 10.48550/ARXIV.1610.02242]
[10] Episodic Training for Domain Generalization
Li, Da
Zhang, Jianshu
Yang, Yongxin
Liu, Cong
Song, Yi-Zhe
Hospedales, Timothy M.
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1446 - 1455

← 1 2 3 4 →