Multi-Prototype Guided Source-Free Domain Adaptive Object Detection for Autonomous Driving

被引：2

作者：

Zhang, Siqi ^{[1
,2
]}

Zhang, Lu ^{[3
]}

Li, Guangsen ^{[3
]}

Li, Pengcheng ^{[3
]}

Liu, Zhiyong ^{[2
,3
,4
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial telligence Sy, Beijing 100045, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 101408, Peoples R China

[3] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100045, Peoples R China

[4] Nanjing Artificial Intelligence Res IA, Nanjing 211134, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期

关键词：

Prototypes; Labeling; Object detection; Adaptation models; Detectors; Noise measurement; Task analysis; self training; source-free domain adaptation; transfer learning;

D O I：

10.1109/TIV.2023.3337795

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Source-free domain adaptive object detection (source-free DAOD) seeks to adapt a detector pre-trained on a source domain to an unlabeled target domain without requiring access to annotated source domain data. To address challenges posed by domain shifts, current source-free DAOD approaches mainly rely on the self-training paradigm, where pseudo labels are predicted and employed to fine-tune the detector on unlabeled target domain. However, these methods often encounter issues related to intra-class variation, resulting in category-specific biases and noisy pseudo labels. In response, we present an effective Multi-Prototype Guided source-free DAOD method, dubbed MPG, consisting of two key components: multi-prototype guided pseudo labeling (MPPL) and multi-prototype guided consistency regularization (MPCR) modules. In the MPPL module, we construct category-specific multiple prototypes to better represent the category with intra-class variations. Specifically, multiple prototypes with adaptive cluster centroids are introduced for each category to effectively capture the intra-class variations. Through the implementation of the proposed MPPL module, we derive more accurate pseudo labels by assessing the proximity of instance features to multiple category prototypes. In the MPCR module, we introduce multi-level consistency regularization, including prototype-based consistency and prediction consistency, which encourages the model to overlook style perturbations and learn domain-invariant representations. Extensive experiments on five public driving datasets demonstrate that MPG outperforms existing state-of-the-art methods, showcasing its effectiveness in adapting object detectors to target domains.

引用

页码：1589 / 1601

页数：13

共 71 条

[1] Large-Scale Machine Learning with Stochastic Gradient Descent
Bottou, Leon
[J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
[2] Exploring Object Relation in Mean Teacher for Cross-Domain Detection
Cai, Qi
Pan, Yingwei
Ngo, Chong-Wah
Tian, Xinmei
Duan, Lingyu
Yao, Ting
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11449 - 11458
[3] Chen T, 2020, PR MACH LEARN RES, V119
[4] Domain Adaptive Faster R-CNN for Object Detection in the Wild
Chen, Yuhua
Li, Wen
Sakaridis, Christos
Dai, Dengxin
Van Gool, Luc
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
[5] Chu Q., 2023, arXiv, DOI [DOI 10.48550/ARXIV.2301.04265,CS, 10.48550/arXiv.2301.04265, DOI 10.48550/ARXIV.2301.04265]
[6] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8] Unbiased Mean Teacher for Cross-domain Object Detection
Deng, Jinhong
Li, Wen
Chen, Yuhua
Duan, Lixin
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4089 - 4099
[9] A Review and Comparative Study on Probabilistic Object Detection in Autonomous Driving
Feng, Di
Harakeh, Ali
Waslander, Steven L.
Dietmayer, Klaus
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 9961 - 9980
[10] French Geoffrey, 2018, ICLR

← 1 2 3 4 5 6 7 8 →