Domain Adaptation of Anchor-Free object detection for urban traffic

被引:14
作者
Yu, Xiaoyong [1 ,2 ]
Lu, Xiaoqiang [3 ]
机构
[1] Chinese Acad Sci, Key Lab Spectral Imaging Technol CAS, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Urban traffic; Domain adaptation; Object detection;
D O I
10.1016/j.neucom.2024.127477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern detectors are mostly trained under single and limited conditions. However, object detection faces various complex and open situations in autonomous driving, especially in urban street scenes with dense objects and complex backgrounds. Due to the shift in data distribution, modern detectors cannot perform well in actual urban environments. Using domain adaptation to improve detection performance is one of the key methods to extend object detection from limited situations to open situations. To this end, this article proposes a Domain Adaptation of Anchor -Free object detection (DAAF) for urban traffic. DAAF is a crossdomain object detection method that performs feature alignment including two aspects. On the one hand, we designed a fully convolutional adversarial training method for global feature alignment at the image level. Meanwhile, images can generally be decomposed into structural information and texture information. In urban street scenes, the structural information of images is generally similar. The main difference between the source domain and the target domain is texture information. Therefore, during global feature alignment, this paper proposes a method called texture information limitation (TIL). On the other hand, in order to solve the problem of variable aspect ratios of objects in urban street scenes, this article uses an anchor -free detector as the baseline detector. Since the anchor -free object detector can obtain neither explicit nor implicit instance -level features, we adopt Pixel -Level Adaptation (PLA) to align local features instead of instance -level alignment for local features. The size of the object has the greatest impact on the final detection effect, and the object scale in urban scenes is relatively rich. Guided by the differentiation of attention mechanisms, a multi -level adversarial network is designed to perform feature alignment of the output space at different feature levels called Scale Information Limitation (SIL). We conducted cross -domain detection experiments by using various urban streetscape autonomous driving object detection datasets, including adverse weather conditions, synthetic data to real data, and cross -camera adaptation. The experimental results indicate that the method proposed in this article is effective.
引用
收藏
页数:15
相关论文
共 56 条
[1]   Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night [J].
Arruda, Vinicius F. ;
Paixao, Thiago M. ;
Berriel, Rodrigo F. ;
De Souza, Alberto F. ;
Badue, Claudine ;
Sebe, Nicu ;
Oliveira-Santos, Thiago .
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[2]   A theory of learning from different domains [J].
Ben-David, Shai ;
Blitzer, John ;
Crammer, Koby ;
Kulesza, Alex ;
Pereira, Fernando ;
Vaughan, Jennifer Wortman .
MACHINE LEARNING, 2010, 79 (1-2) :151-175
[3]   Simultaneous structure and texture image inpainting [J].
Bertalmio, M ;
Vese, L ;
Sapiro, G ;
Osher, S .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2003, 12 (08) :882-889
[4]   Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks [J].
Bousmalis, Konstantinos ;
Silberman, Nathan ;
Dohan, David ;
Erhan, Dumitru ;
Krishnan, Dilip .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :95-104
[5]   Exploring Object Relation in Mean Teacher for Cross-Domain Detection [J].
Cai, Qi ;
Pan, Yingwei ;
Ngo, Chong-Wah ;
Tian, Xinmei ;
Duan, Lingyu ;
Yao, Ting .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11449-11458
[6]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[7]   Relation Matters: Foreground-Aware Graph-Based Relational Reasoning for Domain Adaptive Object Detection [J].
Chen, Chaoqi ;
Li, Jiongcheng ;
Zhou, Hong-Yu ;
Han, Xiaoguang ;
Huang, Yue ;
Ding, Xinghao ;
Yu, Yizhou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) :3677-3694
[8]   I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors [J].
Chen, Chaoqi ;
Zheng, Zebiao ;
Huang, Yue ;
Ding, Xinghao ;
Yu, Yizhou .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12571-12580
[9]   Harmonizing Transferability and Discriminability for Adapting Object Detectors [J].
Chen, Chaoqi ;
Zheng, Zebiao ;
Ding, Xinghao ;
Huang, Yue ;
Dou, Qi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8866-8875
[10]   Image-denoising algorithm based on improved K-singular value decomposition and atom optimization [J].
Chen, Rui ;
Pu, Dong ;
Tong, Ying ;
Wu, Minghu .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (01) :117-127