On the Importance of Attention and Augmentations for Hypothesis Transfer in Domain Adaptation and Generalization

被引：3

作者：

Thomas, Georgi ^{[1
]}

Sahay, Rajat ^{[1
]}

Jahan, Chowdhury Sadman ^{[1
]}

Manjrekar, Mihir ^{[1
]}

Popp, Dan ^{[1
]}

Savakis, Andreas ^{[1
]}

机构：

[1] Rochester Inst Technol, Rochester, NY 14623 USA

来源：

SENSORS | 2023年 / 23卷 / 20期

关键词：

domain adaptation; domain generalization; vision transformers; convolutional neural networks;

D O I：

10.3390/s23208409

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Unsupervised domain adaptation (UDA) aims to mitigate the performance drop due to the distribution shift between the training and testing datasets. UDA methods have achieved performance gains for models trained on a source domain with labeled data to a target domain with only unlabeled data. The standard feature extraction method in domain adaptation has been convolutional neural networks (CNNs). Recently, attention-based transformer models have emerged as effective alternatives for computer vision tasks. In this paper, we benchmark three attention-based architectures, specifically vision transformer (ViT), shifted window transformer (SWIN), and dual attention vision transformer (DAViT), against convolutional architectures ResNet, HRNet and attention-based ConvNext, to assess the performance of different backbones for domain generalization and adaptation. We incorporate these backbone architectures as feature extractors in the source hypothesis transfer (SHOT) framework for UDA. SHOT leverages the knowledge learned in the source domain to align the image features of unlabeled target data in the absence of source domain data, using self-supervised deep feature clustering and self-training. We analyze the generalization and adaptation performance of these models on standard UDA datasets and aerial UDA datasets. In addition, we modernize the training procedure commonly seen in UDA tasks by adding image augmentation techniques to help models generate richer features. Our results show that ConvNext and SWIN offer the best performance, indicating that the attention mechanism is very beneficial for domain generalization and adaptation with both transformer and convolutional architectures. Our ablation study shows that our modernized training recipe, within the SHOT framework, significantly boosts performance on aerial datasets.

引用

页数：22

共 50 条

[21] On generalization in moment-based domain adaptation
Zellinger, Werner
Moser, Bernhard A.
Saminger-Platz, Susanne
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2021, 89 (3-4) : 333 - 369
[22] Style Normalization and Restitution for Domain Generalization and Adaptation
Jin, Xin
Lan, Cuiling
Zeng, Wenjun
Chen, Zhibo
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3636 - 3651
[23] On generalization in moment-based domain adaptation
Werner Zellinger
Bernhard A. Moser
Susanne Saminger-Platz
Annals of Mathematics and Artificial Intelligence, 2021, 89 : 333 - 369
[24] Domain Generalization by Marginal Transfer Learning
Blanchard, Gilles
Deshmukh, Aniket Anand
Dogan, Urun
Lee, Gyemin
Scott, Clayton
JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
[25] On the Importance of Domain Adaptation in Texture Classification
Caputo, Barbara
Cusano, Claudio
Lanzi, Martina
Napoletano, Paolo
Schettini, Raimondo
IMAGE ANALYSIS AND PROCESSING,(ICIAP 2017), PT I, 2017, 10484 : 380 - 390
[26] Collaborative Contrastive Learning for Hypothesis Domain Adaptation
Chien, Jen-Tzung
Yeh, I-Ping
Mak, Man-Wai
INTERSPEECH 2024, 2024, : 3225 - 3229
[27] Improving domain generalization by hybrid domain attention and localized maximum sensitivity
Ng, Wing W. Y.
Zhang, Qin
Zhong, Cankun
Zhang, Jianjun
NEURAL NETWORKS, 2024, 171 : 320 - 331
[28] CNNs with Multi-Level Attention for Domain Generalization
Ballas, Aristotelis
Diou, Cristos
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 592 - 596
[29] Domain Adaptation and Generalization: A Low-Complexity Approach
Niemeijer, Joshua
Schaefer, Joerg P.
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1081 - 1091
[30] Alleviating the generalization issue in adversarial domain adaptation networks
Zhe, Xiao
Du, Zhekai
Lou, Chunwei
Li, Jingjing
IMAGE AND VISION COMPUTING, 2023, 135

← 1 2 3 4 5 →