Multi-Prototype Guided Source-Free Domain Adaptive Object Detection for Autonomous Driving

被引:2
作者
Zhang, Siqi [1 ,2 ]
Zhang, Lu [3 ]
Li, Guangsen [3 ]
Li, Pengcheng [3 ]
Liu, Zhiyong [2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial telligence Sy, Beijing 100045, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 101408, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100045, Peoples R China
[4] Nanjing Artificial Intelligence Res IA, Nanjing 211134, Peoples R China
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期
关键词
Prototypes; Labeling; Object detection; Adaptation models; Detectors; Noise measurement; Task analysis; self training; source-free domain adaptation; transfer learning;
D O I
10.1109/TIV.2023.3337795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Source-free domain adaptive object detection (source-free DAOD) seeks to adapt a detector pre-trained on a source domain to an unlabeled target domain without requiring access to annotated source domain data. To address challenges posed by domain shifts, current source-free DAOD approaches mainly rely on the self-training paradigm, where pseudo labels are predicted and employed to fine-tune the detector on unlabeled target domain. However, these methods often encounter issues related to intra-class variation, resulting in category-specific biases and noisy pseudo labels. In response, we present an effective Multi-Prototype Guided source-free DAOD method, dubbed MPG, consisting of two key components: multi-prototype guided pseudo labeling (MPPL) and multi-prototype guided consistency regularization (MPCR) modules. In the MPPL module, we construct category-specific multiple prototypes to better represent the category with intra-class variations. Specifically, multiple prototypes with adaptive cluster centroids are introduced for each category to effectively capture the intra-class variations. Through the implementation of the proposed MPPL module, we derive more accurate pseudo labels by assessing the proximity of instance features to multiple category prototypes. In the MPCR module, we introduce multi-level consistency regularization, including prototype-based consistency and prediction consistency, which encourages the model to overlook style perturbations and learn domain-invariant representations. Extensive experiments on five public driving datasets demonstrate that MPG outperforms existing state-of-the-art methods, showcasing its effectiveness in adapting object detectors to target domains.
引用
收藏
页码:1589 / 1601
页数:13
相关论文
共 71 条
  • [1] Large-Scale Machine Learning with Stochastic Gradient Descent
    Bottou, Leon
    [J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
  • [2] Exploring Object Relation in Mean Teacher for Cross-Domain Detection
    Cai, Qi
    Pan, Yingwei
    Ngo, Chong-Wah
    Tian, Xinmei
    Duan, Lingyu
    Yao, Ting
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11449 - 11458
  • [3] Chen T, 2020, PR MACH LEARN RES, V119
  • [4] Domain Adaptive Faster R-CNN for Object Detection in the Wild
    Chen, Yuhua
    Li, Wen
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
  • [5] Chu Q., 2023, arXiv, DOI [DOI 10.48550/ARXIV.2301.04265,CS, 10.48550/arXiv.2301.04265, DOI 10.48550/ARXIV.2301.04265]
  • [6] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [8] Unbiased Mean Teacher for Cross-domain Object Detection
    Deng, Jinhong
    Li, Wen
    Chen, Yuhua
    Duan, Lixin
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4089 - 4099
  • [9] A Review and Comparative Study on Probabilistic Object Detection in Autonomous Driving
    Feng, Di
    Harakeh, Ali
    Waslander, Steven L.
    Dietmayer, Klaus
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 9961 - 9980
  • [10] French Geoffrey, 2018, ICLR