On the Importance of Attention and Augmentations for Hypothesis Transfer in Domain Adaptation and Generalization

被引:3
|
作者
Thomas, Georgi [1 ]
Sahay, Rajat [1 ]
Jahan, Chowdhury Sadman [1 ]
Manjrekar, Mihir [1 ]
Popp, Dan [1 ]
Savakis, Andreas [1 ]
机构
[1] Rochester Inst Technol, Rochester, NY 14623 USA
关键词
domain adaptation; domain generalization; vision transformers; convolutional neural networks;
D O I
10.3390/s23208409
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Unsupervised domain adaptation (UDA) aims to mitigate the performance drop due to the distribution shift between the training and testing datasets. UDA methods have achieved performance gains for models trained on a source domain with labeled data to a target domain with only unlabeled data. The standard feature extraction method in domain adaptation has been convolutional neural networks (CNNs). Recently, attention-based transformer models have emerged as effective alternatives for computer vision tasks. In this paper, we benchmark three attention-based architectures, specifically vision transformer (ViT), shifted window transformer (SWIN), and dual attention vision transformer (DAViT), against convolutional architectures ResNet, HRNet and attention-based ConvNext, to assess the performance of different backbones for domain generalization and adaptation. We incorporate these backbone architectures as feature extractors in the source hypothesis transfer (SHOT) framework for UDA. SHOT leverages the knowledge learned in the source domain to align the image features of unlabeled target data in the absence of source domain data, using self-supervised deep feature clustering and self-training. We analyze the generalization and adaptation performance of these models on standard UDA datasets and aerial UDA datasets. In addition, we modernize the training procedure commonly seen in UDA tasks by adding image augmentation techniques to help models generate richer features. Our results show that ConvNext and SWIN offer the best performance, indicating that the attention mechanism is very beneficial for domain generalization and adaptation with both transformer and convolutional architectures. Our ablation study shows that our modernized training recipe, within the SHOT framework, significantly boosts performance on aerial datasets.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] AdvST: Revisiting Data Augmentations for Single Domain Generalization
    Zheng, Guangtao
    Huai, Mengdi
    Zhang, Aidong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21832 - 21840
  • [3] Source Hypothesis Transfer for Zero-Shot Domain Adaptation
    Sakai, Tomoya
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 570 - 586
  • [4] Attention Diversification for Domain Generalization
    Meng, Rang
    Li, Xianfeng
    Chen, Weijie
    Yang, Shicai
    Song, Jie
    Wang, Xinchao
    Zhang, Lei
    Song, Mingli
    Xie, Di
    Pu, Shiliang
    COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 322 - 340
  • [5] Attention modulates generalization of visuomotor adaptation
    Bedard, Patrick
    Song, Joo-Hyun
    JOURNAL OF VISION, 2013, 13 (12):
  • [6] Respecting Domain Relations: Hypothesis Invariance for Domain Generalization
    Wang, Ziqi
    Loog, Marco
    van Gemert, Jan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9756 - 9763
  • [7] DLOW: Domain Flow for Adaptation and Generalization
    Gong, Rui
    Li, Wen
    Chen, Yuhua
    Van Gool, Luc
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2472 - 2481
  • [8] Domain Attention Model for Domain Generalization in Object Detection
    He, Weixiong
    Zheng, Huicheng
    Lai, Jianhuang
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 27 - 39
  • [9] GENERALIZATION BOUNDS FOR DOMAIN ADAPTATION VIA DOMAIN TRANSFORMATIONS
    Vural, Elif
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [10] Source Data-Absent Unsupervised Domain Adaptation Through Hypothesis Transfer and Labeling Transfer
    Liang, Jian
    Hu, Dapeng
    Wang, Yunbo
    He, Ran
    Feng, Jiashi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 8602 - 8617