Conditional Context-Aware Feature Alignment for Domain Adaptive Detection Transformer

被引：1

作者：

Chen, Siyuan ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Peoples R China

来源：

MULTIMEDIA MODELING (MMM 2022), PT I | 2022年 / 13141卷

关键词：

Unsupervised domain adaptation; Object detection; Detection transformer;

D O I：

10.1007/978-3-030-98358-1_22

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detection transformers have recently gained increasing attention, due to its competitive performance and end-to-end pipeline. However, they suffer significant performance drop when the test and training data are drawn from different distributions. Existing domain adaptive detection transformer methods adopt feature distribution alignment to alleviate the domain gaps. While effective, they ignore the class semantics and rich context preserved in attention mechanism during adaptation, which leads to inferior performance. To tackle these problems, we propose Conditional Context-aware Feature Alignment (CCFA) for domain adaptive detection transformer. Specifically, a context-aware feature alignment module is proposed to map the high-dimensional context into low-dimensional space, so that the rich context can be utilized for distribution alignment without optimization difficulty. Moreover, a conditional distribution alignment module is adopted to align features of the same object class from different domains, which better preserves discriminability during adaptation. Experiments on three common benchmarks demonstrate CCFA's superiority over state-of-the-arts.

引用

页码：272 / 283

页数：12

共 28 条

[1] Exploring Object Relation in Mean Teacher for Cross-Domain Detection [J].

Cai, Qi ;

Pan, Yingwei ;

Ngo, Chong-Wah ;

Tian, Xinmei ;

Duan, Lingyu ;

Yao, Ting .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11449-11458

[2] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[3] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[4]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[5]

Depu M, 2021, P IEEE INT C COMP VI

[6]

Ganin Y, 2017, ADV COMPUT VIS PATT, P189, DOI 10.1007/978-3-319-58347-1_10

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8] Every Pixel Matters: Center-Aware Feature Alignment for Domain Adaptive Object Detector [J].

Hsu, Cheng-Chun ;

Tsai, Yi-Hsuan ;

Lin, Yen-Yu ;

Yang, Ming-Hsuan .

COMPUTER VISION - ECCV 2020, PT IX, 2020, 12354 :733-748

[9]

Johnson-Roberson Matthew, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P746, DOI 10.1109/ICRA.2017.7989092

[10] Diversify and Match: A Domain Adaptive Representation Learning Paradigm for Object Detection [J].

Kim, Taekyung ;

Jeong, Minki ;

Kim, Seunghyeon ;

Choi, Seokeon ;

Kim, Changick .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12448-12457

← 1 2 3 →