Constructing 3D Object Detectors Based on Deformable Convolutional Guided Depths

被引：0

作者：

Zheng, Xinwang ^{[1
]}

Yang, Guangsong ^{[2
]}

Yang, Lu ^{[1
]}

Lu, Chengyu ^{[1
,3
]}

机构：

[1] Jimei Univ, Chengyi Coll, Xiamen 361021, Peoples R China

[2] Jimei Univ, Sch Ocean Informat Engn, Xiamen 361021, Peoples R China

[3] Southwest Forestry Univ, Sch Mech & Transportat, Kunming 650000, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Three-dimensional displays; Object detection; Feature extraction; Visualization; Solid modeling; Detectors; Transformers; Accuracy; Training; Kernel; Deep learning; Autonomous vehicles; 3D object detection; deep learning; depth guidance; autonomous automobiles; model refinement;

D O I：

10.1109/ACCESS.2024.3488748

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces a depth-guided 3D object detection method that enhances the feature extraction capability of the backbone network through weak supervision. It combines large kernel convolution, global response normalization, and layer normalization techniques to significantly improve feature robustness under weakly supervised conditions. Additionally, the depth estimation module's feature extraction ability is bolstered by optimizing the depth-guided encoder and incorporating large-kernel depthwise separable convolutions alongside a spatial attention mechanism. On the decoder side, deformable convolutions are employed to modulate deep feature maps, reducing inference and training time while minimizing model complexity. This approach avoids the complexity associated with transformer architectures. Experiments on the KITTI 3D dataset demonstrate that the method diminishes reliance on manual labeling and can notably enhance detection accuracy while simultaneously improving processing speed.

引用

页码：162990 / 163000

页数：11

共 44 条

[1] A survey on adversarial attacks and defenses for object detection and their applications in autonomous vehicles
Amirkhani, Abdollah
Karimi, Mohammad Parsa
Banitalebi-Dehkordi, Amin
[J]. VISUAL COMPUTER, 2023, 39 (11) : 5293 - 5307
[2] Enhancing the Robustness of Visual Object Tracking via Style Transfer
Amirkhani, Abdollah
Barshooi, Amir Hossein
Ebrahimi, Amir
[J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01): : 981 - 997
[3] Robust Semantic Segmentation With Multi-Teacher Knowledge Distillation
Amirkhani, Abdollah
Khosravian, Amir
Masih-Tehrani, Masoud
Kashiani, Hossein
[J]. IEEE ACCESS, 2021, 9 : 119049 - 119066
[4] EBCDet: Energy-Based Curriculum for Robust Domain Adaptive Object Detection
Banitalebi-Dehkordi, Amin
Amirkhani, Abdollah
Mohammadinasab, Alireza
[J]. IEEE ACCESS, 2023, 11 : 77810 - 77825
[5] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection
Brazil, Garrick
Liu, Xiaoming
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9286 - 9295
[6] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[7] Chen XZ, 2015, ADV NEUR IN, V28
[8] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships
Chen, Yongjian
Tai, Lei
Sun, Kai
Li, Mingyang
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 12090 - 12099
[9] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[10] Fast Convergence of DETR with Spatially Modulated Co-Attention
Gao, Peng
Zheng, Minghang
Wang, Xiaogang
Dai, Jifeng
Li, Hongsheng
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3601 - 3610

← 1 2 3 4 5 →