SETA: Semantic-Aware Edge-Guided Token Augmentation for Domain Generalization

被引:0
|
作者
Guo, Jintao [1 ,2 ]
Qi, Lei [3 ,4 ]
Shi, Yinghuan [1 ,2 ]
Gao, Yang [1 ,2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[2] Nanjing Univ, Natl Inst Healthcare Data Sci, Nanjing 210023, Peoples R China
[3] Southeast Univ, Sch Comp Sci & Engn, Nanjing 211189, Peoples R China
[4] Southeast Univ, Key Lab New Generat Artificial Intelligence Techno, Minist Educ, Nanjing 211189, Peoples R China
基金
中国博士后科学基金;
关键词
Shape; Perturbation methods; Image edge detection; Robustness; Training; Noise; Feature extraction; Data models; Data augmentation; Transformers; Domain generalization; shape bias; spurious edge augmentation; vision transformer;
D O I
10.1109/TIP.2024.3470517
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain generalization (DG) aims to enhance the model robustness against domain shifts without accessing target domains. A prevalent category of methods for DG is data augmentation, which focuses on generating virtual samples to simulate domain shifts. However, existing augmentation techniques in DG are mainly tailored for convolutional neural networks (CNNs), with limited exploration in token-based architectures, i.e., vision transformer (ViT) and multi-layer perceptrons (MLP) models. In this paper, we study the impact of prior CNN-based augmentation methods on token-based models, revealing their performance is suboptimal due to the lack of incentivizing the model to learn holistic shape information. To tackle the issue, we propose the Semantic-aware Edge-guided Token Augmentation (SETA) method. SETA transforms token features by perturbing local edge cues while preserving global shape features, thereby enhancing the model learning of shape information. To further enhance the generalization ability of the model, we introduce two stylized variants of our method combined with two state-of-the-art (SOTA) style augmentation methods in DG. We provide a theoretical insight into our method, demonstrating its effectiveness in reducing the generalization risk bound. Comprehensive experiments on five benchmarks prove that our method achieves SOTA performances across various ViT and MLP architectures. Our code is available at https://github.com/lingeringlight/SETA.
引用
收藏
页码:5622 / 5636
页数:15
相关论文
共 15 条
  • [1] Cross-Domain Detection Transformer Based on Spatial-Aware and Semantic-Aware Token Alignment
    Deng, Jinhong
    Zhang, Xiaoyue
    Li, Wen
    Duan, Lixin
    Xu, Dong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5234 - 5245
  • [2] NormAUG: Normalization-Guided Augmentation for Domain Generalization
    Qi, Lei
    Yang, Hongpeng
    Shi, Yinghuan
    Geng, Xin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1419 - 1431
  • [3] Inter-Class and Inter-Domain Semantic Augmentation for Domain Generalization
    Wang, Mengzhu
    Liu, Yuehua
    Yuan, Jianlong
    Wang, Shanshan
    Wang, Zhibin
    Wang, Wei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1338 - 1347
  • [4] Semantic-Aware Message Broadcasting for Efficient Unsupervised Domain Adaptation
    Li, Xin
    Lan, Cuiling
    Wei, Guoqiang
    Chen, Zhibo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5340 - 5353
  • [5] Bash comment generation via data augmentation and semantic-aware CodeBERT
    Shen, Yiheng
    Ju, Xiaolin
    Chen, Xiang
    Yang, Guang
    AUTOMATED SOFTWARE ENGINEERING, 2024, 31 (01)
  • [6] Style Augmentation and Domain-Aware Parametric Contrastive Learning for Domain Generalization
    Li, Mingkang
    Zhang, Jiali
    Zhang, Wen
    Gong, Lu
    Zhang, Zili
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 211 - 224
  • [7] Bash comment generation via data augmentation and semantic-aware CodeBERT
    Yiheng Shen
    Xiaolin Ju
    Xiang Chen
    Guang Yang
    Automated Software Engineering, 2024, 31
  • [8] Source-Guided Adversarial Learning and Data Augmentation for Domain Generalization
    Chen Y.
    Lin T.-W.
    Hsu C.-T.
    SN Computer Science, 2020, 1 (6)
  • [9] Concept-guided domain generalization for semantic segmentation
    Liao, Muxin
    Li, Wei
    Yin, Chengle
    Jin, Yuling
    Peng, Yingqiong
    PATTERN RECOGNITION, 2025, 164
  • [10] Semantic Data Augmentation based Distance Metric Learning for Domain Generalization
    Wang, Mengzhu
    Yuan, Jianlong
    Qian, Qi
    Wang, Zhibin
    Li, Hao
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3214 - 3223