Integrated Multiscale Domain Adaptive YOLO

被引:26
作者
Hnewa, Mazin [1 ]
Radha, Hayder [1 ]
机构
[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA
关键词
Detectors; Feature extraction; Object detection; Computer architecture; Training; Adaptive systems; Proposals; domain adaptation; adversarial training; domain shift; multiscale;
D O I
10.1109/TIP.2023.3255106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The area of domain adaptation has been instrumental in addressing the domain shift problem encountered by many deep learning applications. This problem arises due to the difference between the distributions of source data used for training in comparison with target data used during realistic testing scenarios. In this paper, we introduce a novel MultiScale Domain Adaptive YOLO (MS-DAYOLO) framework that employs multiple domain adaptation paths and corresponding domain classifiers at different scales of the YOLOv4 object detector. Building on our baseline multiscale DAYOLO framework, we introduce three novel deep learning architectures for a Domain Adaptation Network (DAN) that generates domain-invariant features. In particular, we propose a Progressive Feature Reduction (PFR), a Unified Classifier (UC), and an Integrated architecture. We train and test our proposed DAN architectures in conjunction with YOLOv4 using popular datasets. Our experiments show significant improvements in object detection performance when training YOLOv4 using the proposed MS-DAYOLO architectures and when tested on target data for autonomous driving applications. Moreover, MS-DAYOLO framework achieves an order of magnitude real-time speed improvement relative to Faster R-CNN solutions while providing comparable object detection performance.
引用
收藏
页码:1857 / 1867
页数:11
相关论文
共 48 条
  • [1] Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night
    Arruda, Vinicius F.
    Paixao, Thiago M.
    Berriel, Rodrigo F.
    De Souza, Alberto F.
    Badue, Claudine
    Sebe, Nicu
    Oliveira-Santos, Thiago
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [3] Exploring Object Relation in Mean Teacher for Cross-Domain Detection
    Cai, Qi
    Pan, Yingwei
    Ngo, Chong-Wah
    Tian, Xinmei
    Duan, Lingyu
    Yao, Ting
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11449 - 11458
  • [4] Domain Adaptive Faster R-CNN for Object Detection in the Wild
    Chen, Yuhua
    Li, Wen
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
  • [5] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [6] Dai JF, 2016, ADV NEUR IN, V29
  • [7] Borrow from Anywhere: Pseudo Multi-modal Object Detection in Thermal Imagery
    Devaguptapu, Chaitanya
    Akolekar, Ninad
    Sharma, Manuj M.
    Balasubramanian, Vineeth N.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1029 - 1038
  • [8] Domain Transfer Multiple Kernel Learning
    Duan, Lixin
    Tsang, Ivor W.
    Xu, Dong
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (03) : 465 - 479
  • [9] Everingham M, 2011, PATTERN ANAL STAT MO, P8
  • [10] Ganin Y., 2015, Domain-Adversarial Training of Neural Networks, V17, P2030, DOI DOI 10.1007/978-3-319-58347-110