Cross-Domain Soft Adaptive Teacher for Syn2Real Object Detection

被引:0
|
作者
Guo, Weijie [1 ]
He, Boyong [1 ]
Wu, Yaoyuan [1 ]
Li, Xianjiang [2 ]
Wu, Liaoni [1 ,2 ]
机构
[1] Xiamen Univ, Inst Artificial Intelligence, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Sch Aerosp Engn, Xiamen 361102, Peoples R China
关键词
Unsupervised domain adaption; Synthetic to real; Object detection;
D O I
10.1007/978-981-99-8537-1_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current state-of-the-art object detectors are constructed using supervised deep-learning approaches. These approaches require a large amount of annotated training data. Although synthetic image-generation methods can provide a large amount of annotated data, unsupervised transfer of object-recognition models from synthetic to real domains is a complicated problem given the large gap between the domains. To mitigate this problem, in this paper, we propose a general synthetic-to-real cross-domain object-detection framework. In this framework, we establish a simple mean teacher model for most detectors and propose a teacher-student framework named soft adaptive teacher (SAT). This leverages domain adversarial learning and domain-adaption augmentation to address the domain gap. Specifically, we alleviate bias by augmenting training samples with image-level adaptations for the student model. Moreover, we employ feature-level adversarial training in the student model, allowing features derived from the source and target domains to share similar distributions. Finally, we introduce the soft teacher mechanism to select reliable pseudo-labels for the teacher model. By tackling the model-bias issue using these strategies, our SAT model was found to achieve average precision values of 57.2% (55.7%) on the Sim10k to Cityscape (Sim10k to BDD100k) benchmarks, 3.1 (10.4) percentage points higher than the previous state-of-the-art methods. Furthermore, we achieved an average precision of 66.2% on the dataset for object detection in aerial images (DOTA), and this is 31.2% points higher than the results from the Faster RCNN model without domain adaptation trained only with labeled source domain images.
引用
收藏
页码:460 / 472
页数:13
相关论文
共 50 条
  • [1] Cross-Domain Adaptive Teacher for Object Detection
    Li, Yu-Jhe
    Dai, Xiaoliang
    Ma, Chih-Yao
    Liu, Yen-Cheng
    Chen, Kan
    Wu, Bichen
    He, Zijian
    Kitani, Kris
    Vajda, Peter
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7571 - 7580
  • [2] Harmonious Teacher for Cross-domain Object Detection
    Deng, Jinhong
    Xu, Dongli
    Li, Wen
    Duan, Lixin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23829 - 23838
  • [3] Unbiased Mean Teacher for Cross-domain Object Detection
    Deng, Jinhong
    Li, Wen
    Chen, Yuhua
    Duan, Lixin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4089 - 4099
  • [4] MULTISCALE DOMAIN ADAPTIVE YOLO FOR CROSS-DOMAIN OBJECT DETECTION
    Hnewa, Mazin
    Radha, Hayder
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3323 - 3327
  • [5] Cross-Domain Object Detection by Dual Adaptive Branch
    Liu, Xinyi
    Zhang, Baofeng
    Liu, Na
    SENSORS, 2023, 23 (03)
  • [6] Exploring Object Relation in Mean Teacher for Cross-Domain Detection
    Cai, Qi
    Pan, Yingwei
    Ngo, Chong-Wah
    Tian, Xinmei
    Duan, Lingyu
    Yao, Ting
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11449 - 11458
  • [7] MTTrans: Cross-domain Object Detection with Mean Teacher Transformer
    Yu, Jinze
    Liu, Jiaming
    Wei, Xiaobao
    Zhou, Haoyi
    Nakata, Yohei
    Gudovskiy, Denis
    Okuno, Tomoyuki
    Li, Jianxin
    Keutzer, Kurt
    Zhang, Shanghang
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 629 - 645
  • [8] Syn2Real Domain Generalization for Underwater Mine-Like Object Detection Using Side-Scan Sonar
    Agrawal, Aayush
    Sikdar, Aniruddh
    Makam, Rajini
    Sundaram, Suresh
    Besai, Suresh Kumar
    Gopi, Mahesh
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [9] Style-Guided Adversarial Teacher for Cross-Domain Object Detection
    Jia, Longfei
    Tian, Xianlong
    Hu, Yuguo
    Jing, Mengmeng
    Zuo, Lin
    Li, Wen
    ELECTRONICS, 2024, 13 (05)
  • [10] Syn2Real: Forgery Classification via Unsupervised Domain Adaptation
    Kumar, Akash
    Bhavsar, Arnav
    Verma, Rajesh
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW), 2020, : 63 - 70