Integrated Multiscale Domain Adaptive YOLO

被引：26

作者：

Hnewa, Mazin ^{[1
]}

Radha, Hayder ^{[1
]}

机构：

[1] Michigan State Univ, Dept Elect & Comp Engn, E Lansing, MI 48824 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Detectors; Feature extraction; Object detection; Computer architecture; Training; Adaptive systems; Proposals; domain adaptation; adversarial training; domain shift; multiscale;

D O I：

10.1109/TIP.2023.3255106

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The area of domain adaptation has been instrumental in addressing the domain shift problem encountered by many deep learning applications. This problem arises due to the difference between the distributions of source data used for training in comparison with target data used during realistic testing scenarios. In this paper, we introduce a novel MultiScale Domain Adaptive YOLO (MS-DAYOLO) framework that employs multiple domain adaptation paths and corresponding domain classifiers at different scales of the YOLOv4 object detector. Building on our baseline multiscale DAYOLO framework, we introduce three novel deep learning architectures for a Domain Adaptation Network (DAN) that generates domain-invariant features. In particular, we propose a Progressive Feature Reduction (PFR), a Unified Classifier (UC), and an Integrated architecture. We train and test our proposed DAN architectures in conjunction with YOLOv4 using popular datasets. Our experiments show significant improvements in object detection performance when training YOLOv4 using the proposed MS-DAYOLO architectures and when tested on target data for autonomous driving applications. Moreover, MS-DAYOLO framework achieves an order of magnitude real-time speed improvement relative to Faster R-CNN solutions while providing comparable object detection performance.

引用

页码：1857 / 1867

页数：11

共 48 条

[1] Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night
Arruda, Vinicius F.
Paixao, Thiago M.
Berriel, Rodrigo F.
De Souza, Alberto F.
Badue, Claudine
Sebe, Nicu
Oliveira-Santos, Thiago
[J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[2] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3] Exploring Object Relation in Mean Teacher for Cross-Domain Detection
Cai, Qi
Pan, Yingwei
Ngo, Chong-Wah
Tian, Xinmei
Duan, Lingyu
Yao, Ting
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11449 - 11458
[4] Domain Adaptive Faster R-CNN for Object Detection in the Wild
Chen, Yuhua
Li, Wen
Sakaridis, Christos
Dai, Dengxin
Van Gool, Luc
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
[5] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[6] Dai JF, 2016, ADV NEUR IN, V29
[7] Borrow from Anywhere: Pseudo Multi-modal Object Detection in Thermal Imagery
Devaguptapu, Chaitanya
Akolekar, Ninad
Sharma, Manuj M.
Balasubramanian, Vineeth N.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1029 - 1038
[8] Domain Transfer Multiple Kernel Learning
Duan, Lixin
Tsang, Ivor W.
Xu, Dong
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (03) : 465 - 479
[9] Everingham M, 2011, PATTERN ANAL STAT MO, P8
[10] Ganin Y., 2015, Domain-Adversarial Training of Neural Networks, V17, P2030, DOI DOI 10.1007/978-3-319-58347-110

← 1 2 3 4 5 →