Augmentation, Mixing, and Consistency Regularization for Domain Generalization

被引:1
作者
Mehmood, Noaman [1 ]
Barner, Kenneth [1 ]
机构
[1] Univ Delaware, Elect & Comp Engn Dept, Newark, DE 19711 USA
来源
2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024 | 2024年
关键词
generalization; augmentation; discriminative; regularization; consistency;
D O I
10.1109/ICMI60790.2024.10585938
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work addresses the prevalent challenge in contemporary deep neural networks (DNNs), in which performance diminishes when confronted with testing data that differ in distribution from the training data. We approach this problem using Domain Generalization (DG), training the model on multiple related source domains to bolster its performance on an unseen target domain. Our approach is a tripartite solution involving data augmentation, mixing of style information, and consistency regularization. Data augmentation is achieved by mixing the amplitude spectrum of two distinct images in the frequency domain. Given that image style is intrinsically tied to the visual domain, the style information is mixed into the lower layers of the neural network. This step familiarizes the model with a range of features, enhancing its ability to generalize on unseen target data. Consistency regularization is then introduced to reduce the prediction error between the original and augmented samples, further elevating the performance. Using three distinct benchmarks, extensive experiments were performed that confirm the Augmentation, Mixing, and Consistency Regularization (AMCR) framework on unseen target domains in comparison with existing state-of-the-art (SOTA) methods. The findings underscore the value of DNNs that can effectively generalize across diverse environments, particularly in real-world applications such as autonomous driving.
引用
收藏
页数:6
相关论文
共 32 条
  • [1] Balaji Y, 2018, ADV NEUR IN, V31
  • [2] Blanchard G., 2011, ADV NEURAL INFORM PR, V24, P2178
  • [3] Domain Generalization by Solving Jigsaw Puzzles
    Carlucci, Fabio M.
    D'Innocente, Antonio
    Bucci, Silvia
    Caputo, Barbara
    Tommasi, Tatiana
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2224 - 2233
  • [4] Finn C, 2017, PR MACH LEARN RES, V70
  • [5] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [6] Hinton G, 2015, Arxiv, DOI [arXiv:1503.02531, DOI 10.48550/ARXIV.1503.02531]
  • [7] Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
    Huang, Xun
    Belongie, Serge
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1510 - 1519
  • [8] Kaiyang Zhou, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12361), P561, DOI 10.1007/978-3-030-58517-4_33
  • [9] Laine S, 2017, Arxiv, DOI [arXiv:1610.02242, DOI 10.48550/ARXIV.1610.02242]
  • [10] Episodic Training for Domain Generalization
    Li, Da
    Zhang, Jianshu
    Yang, Yongxin
    Liu, Cong
    Song, Yi-Zhe
    Hospedales, Timothy M.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1446 - 1455