DATR: Unsupervised Domain Adaptive Detection Transformer With Dataset-Level Adaptation and Prototypical Alignment

被引：0

作者：

Chen, Liang ^{[1
,2
,3
]}

Han, Jianhong ^{[1
,2
,3
]}

Wang, Yupei ^{[1
,2
,3
]}

机构：

[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China

[2] Chongqing Innovat Ctr, Beijing Inst Technol, Chongqing 401135, Peoples R China

[3] Natl Key Lab Space Born Intelligent Informat Proc, Beijing 100081, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2025年 / 34卷

基金：

中国国家自然科学基金;

关键词：

Unsupervised domain adaptation; object detection;

D O I：

10.1109/TIP.2025.3527370

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the success of the DEtection TRansformer (DETR), numerous researchers have explored its effectiveness in addressing unsupervised domain adaptation tasks. Existing methods leverage carefully designed feature alignment techniques to align the backbone or encoder, yielding promising results. However, effectively aligning instance-level features within the unique decoder structure of the detector has largely been neglected. Related techniques primarily align instance-level features in a class-agnostic manner, overlooking distinctions between features from different categories, which results in only limited improvements. Furthermore, the scope of current alignment modules in the decoder is often restricted to a limited batch of images, failing to capture the dataset-level cues, thereby severely constraining the detector's generalization ability to the target domain. To this end, we introduce a strong DETR-based detector named Domain Adaptive detection TRansformer (DATR) for unsupervised domain adaptation of object detection. First, we propose the Class-wise Prototypes Alignment (CPA) module, which effectively aligns cross-domain features in a class-aware manner by bridging the gap between the object detection task and the domain adaptation task. Then, the designed Dataset-level Alignment Scheme (DAS) explicitly guides the detector to achieve global representation and enhance inter-class distinguishability of instance-level features across the entire dataset, which spans both domains, by leveraging contrastive learning. Moreover, DATR incorporates a mean-teacher-based self-training framework, utilizing pseudo-labels generated by the teacher model to further mitigate domain bias. Extensive experimental results demonstrate superior performance and generalization capabilities of our proposed DATR in multiple domain adaptation scenarios. Code is released at https://github.com/h751410234/DATR.

引用

页码：982 / 994

页数：13

共 58 条

[11] Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[12] Cascading Alignment for Unsupervised Domain-Adaptive DETR with Improved DeNoising Anchor Boxes
Geng, Huantong
Jiang, Jun
Shen, Junye
Hou, Mengmeng
[J]. SENSORS, 2022, 22 (24)
[13] Improving Transferability for Domain Adaptive Detection Transformers
Gong, Kaixiong
Li, Shuang
Li, Shugang
Zhang, Rui
Liu, Chi Harold
Chen, Qiang
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1543 - 1551
[14] ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-frequency Transform for Domain Generalization
Guo, Jintao
Wang, Na
Qi, Lei
Shi, Yinghuan
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24132 - 24141
[15] Remote Sensing Teacher: Cross-Domain Detection Transformer With Learnable Frequency-Enhanced Feature Alignment in Remote Sensing Imagery
Han, Jianhong
Yang, Wenjie
Wang, Yupei
Chen, Liang
Luo, Zhaoyi
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
[16] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[17] Bidirectional Alignment for Domain Adaptive Detection with Transformers
He, Liqiang
Wang, Wei
Chen, Albert
Sun, Min
Kuo, Cheng-Hao
Todorovic, Sinisa
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18729 - 18739
[18] Integrated Multiscale Domain Adaptive YOLO
Hnewa, Mazin
Radha, Hayder
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1857 - 1867
[19] Huang WJ, 2022, PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, P972
[20] Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation
Jiang, Zhengkai
Li, Yuxi
Yang, Ceyuan
Gao, Peng
Wang, Yabiao
Tai, Ying
Wang, Chengjie
[J]. COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 36 - 54

← 1 2 3 4 5 6 →