On the Importance of Attention and Augmentations for Hypothesis Transfer in Domain Adaptation and Generalization

被引：3

作者：

Thomas, Georgi ^{[1
]}

Sahay, Rajat ^{[1
]}

Jahan, Chowdhury Sadman ^{[1
]}

Manjrekar, Mihir ^{[1
]}

Popp, Dan ^{[1
]}

Savakis, Andreas ^{[1
]}

机构：

[1] Rochester Inst Technol, Rochester, NY 14623 USA

来源：

SENSORS | 2023年 / 23卷 / 20期

关键词：

domain adaptation; domain generalization; vision transformers; convolutional neural networks;

D O I：

10.3390/s23208409

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Unsupervised domain adaptation (UDA) aims to mitigate the performance drop due to the distribution shift between the training and testing datasets. UDA methods have achieved performance gains for models trained on a source domain with labeled data to a target domain with only unlabeled data. The standard feature extraction method in domain adaptation has been convolutional neural networks (CNNs). Recently, attention-based transformer models have emerged as effective alternatives for computer vision tasks. In this paper, we benchmark three attention-based architectures, specifically vision transformer (ViT), shifted window transformer (SWIN), and dual attention vision transformer (DAViT), against convolutional architectures ResNet, HRNet and attention-based ConvNext, to assess the performance of different backbones for domain generalization and adaptation. We incorporate these backbone architectures as feature extractors in the source hypothesis transfer (SHOT) framework for UDA. SHOT leverages the knowledge learned in the source domain to align the image features of unlabeled target data in the absence of source domain data, using self-supervised deep feature clustering and self-training. We analyze the generalization and adaptation performance of these models on standard UDA datasets and aerial UDA datasets. In addition, we modernize the training procedure commonly seen in UDA tasks by adding image augmentation techniques to help models generate richer features. Our results show that ConvNext and SWIN offer the best performance, indicating that the attention mechanism is very beneficial for domain generalization and adaptation with both transformer and convolutional architectures. Our ablation study shows that our modernized training recipe, within the SHOT framework, significantly boosts performance on aerial datasets.

引用

页数：22

共 50 条

[1] Vision transformers in domain adaptation and domain generalization: a study of robustness
Alijani, Shadi
Fayyad, Jamil
Najjaran, Homayoun
Neural Computing and Applications, 2024, 36 (29) : 17979 - 18007
[2] Source Hypothesis Transfer for Zero-Shot Domain Adaptation
Sakai, Tomoya
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 570 - 586
[3] Open-world Domain Adaptation and Generalization
Zhao, Sicheng
Tao, Jianhua
Ding, Guiguang
PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 201 - 202
[4] Attention Diversification for Domain Generalization
Meng, Rang
Li, Xianfeng
Chen, Weijie
Yang, Shicai
Song, Jie
Wang, Xinchao
Zhang, Lei
Song, Mingli
Xie, Di
Pu, Shiliang
COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 322 - 340
[5] Respecting Domain Relations: Hypothesis Invariance for Domain Generalization
Wang, Ziqi
Loog, Marco
van Gemert, Jan
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9756 - 9763
[6] Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization
Ghifary, Muhammad
Balduzzi, David
Kleijn, W. Bastiaan
Zhang, Mengjie
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (07) : 1414 - 1430
[7] Adversarial and Random Transformations for Robust Domain Adaptation and Generalization
Xiao, Liang
Xu, Jiaolong
Zhao, Dawei
Shang, Erke
Zhu, Qi
Dai, Bin
SENSORS, 2023, 23 (11)
[8] Correlation-aware adversarial domain adaptation and generalization
Rahman, Mohammad Mahfujur
Fookes, Clinton
Baktashmotlagh, Mahsa
Sridharan, Sridha
PATTERN RECOGNITION, 2020, 100
[9] Multi-Domain Transfer Component Analysis for Domain Generalization
Grubinger, Thomas
Birlutiu, Adriana
Schoener, Holger
Natschlaeger, Thomas
Heskes, Tom
NEURAL PROCESSING LETTERS, 2017, 46 (03) : 845 - 855
[10] Multi-Domain Transfer Component Analysis for Domain Generalization
Thomas Grubinger
Adriana Birlutiu
Holger Schöner
Thomas Natschläger
Tom Heskes
Neural Processing Letters, 2017, 46 : 845 - 855

← 1 2 3 4 5 →