DATR: Unsupervised Domain Adaptive Detection Transformer With Dataset-Level Adaptation and Prototypical Alignment

被引:0
作者
Chen, Liang [1 ,2 ,3 ]
Han, Jianhong [1 ,2 ,3 ]
Wang, Yupei [1 ,2 ,3 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[2] Chongqing Innovat Ctr, Beijing Inst Technol, Chongqing 401135, Peoples R China
[3] Natl Key Lab Space Born Intelligent Informat Proc, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Unsupervised domain adaptation; object detection;
D O I
10.1109/TIP.2025.3527370
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the success of the DEtection TRansformer (DETR), numerous researchers have explored its effectiveness in addressing unsupervised domain adaptation tasks. Existing methods leverage carefully designed feature alignment techniques to align the backbone or encoder, yielding promising results. However, effectively aligning instance-level features within the unique decoder structure of the detector has largely been neglected. Related techniques primarily align instance-level features in a class-agnostic manner, overlooking distinctions between features from different categories, which results in only limited improvements. Furthermore, the scope of current alignment modules in the decoder is often restricted to a limited batch of images, failing to capture the dataset-level cues, thereby severely constraining the detector's generalization ability to the target domain. To this end, we introduce a strong DETR-based detector named Domain Adaptive detection TRansformer (DATR) for unsupervised domain adaptation of object detection. First, we propose the Class-wise Prototypes Alignment (CPA) module, which effectively aligns cross-domain features in a class-aware manner by bridging the gap between the object detection task and the domain adaptation task. Then, the designed Dataset-level Alignment Scheme (DAS) explicitly guides the detector to achieve global representation and enhance inter-class distinguishability of instance-level features across the entire dataset, which spans both domains, by leveraging contrastive learning. Moreover, DATR incorporates a mean-teacher-based self-training framework, utilizing pseudo-labels generated by the teacher model to further mitigate domain bias. Extensive experimental results demonstrate superior performance and generalization capabilities of our proposed DATR in multiple domain adaptation scenarios. Code is released at https://github.com/h751410234/DATR.
引用
收藏
页码:982 / 994
页数:13
相关论文
共 58 条
  • [11] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
  • [12] Cascading Alignment for Unsupervised Domain-Adaptive DETR with Improved DeNoising Anchor Boxes
    Geng, Huantong
    Jiang, Jun
    Shen, Junye
    Hou, Mengmeng
    [J]. SENSORS, 2022, 22 (24)
  • [13] Improving Transferability for Domain Adaptive Detection Transformers
    Gong, Kaixiong
    Li, Shuang
    Li, Shugang
    Zhang, Rui
    Liu, Chi Harold
    Chen, Qiang
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1543 - 1551
  • [14] ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-frequency Transform for Domain Generalization
    Guo, Jintao
    Wang, Na
    Qi, Lei
    Shi, Yinghuan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24132 - 24141
  • [15] Remote Sensing Teacher: Cross-Domain Detection Transformer With Learnable Frequency-Enhanced Feature Alignment in Remote Sensing Imagery
    Han, Jianhong
    Yang, Wenjie
    Wang, Yupei
    Chen, Liang
    Luo, Zhaoyi
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [16] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [17] Bidirectional Alignment for Domain Adaptive Detection with Transformers
    He, Liqiang
    Wang, Wei
    Chen, Albert
    Sun, Min
    Kuo, Cheng-Hao
    Todorovic, Sinisa
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18729 - 18739
  • [18] Integrated Multiscale Domain Adaptive YOLO
    Hnewa, Mazin
    Radha, Hayder
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1857 - 1867
  • [19] Huang WJ, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P972
  • [20] Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation
    Jiang, Zhengkai
    Li, Yuxi
    Yang, Ceyuan
    Gao, Peng
    Wang, Yabiao
    Tai, Ying
    Wang, Chengjie
    [J]. COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 36 - 54