Constructing 3D Object Detectors Based on Deformable Convolutional Guided Depths

被引:0
作者
Zheng, Xinwang [1 ]
Yang, Guangsong [2 ]
Yang, Lu [1 ]
Lu, Chengyu [1 ,3 ]
机构
[1] Jimei Univ, Chengyi Coll, Xiamen 361021, Peoples R China
[2] Jimei Univ, Sch Ocean Informat Engn, Xiamen 361021, Peoples R China
[3] Southwest Forestry Univ, Sch Mech & Transportat, Kunming 650000, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Three-dimensional displays; Object detection; Feature extraction; Visualization; Solid modeling; Detectors; Transformers; Accuracy; Training; Kernel; Deep learning; Autonomous vehicles; 3D object detection; deep learning; depth guidance; autonomous automobiles; model refinement;
D O I
10.1109/ACCESS.2024.3488748
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a depth-guided 3D object detection method that enhances the feature extraction capability of the backbone network through weak supervision. It combines large kernel convolution, global response normalization, and layer normalization techniques to significantly improve feature robustness under weakly supervised conditions. Additionally, the depth estimation module's feature extraction ability is bolstered by optimizing the depth-guided encoder and incorporating large-kernel depthwise separable convolutions alongside a spatial attention mechanism. On the decoder side, deformable convolutions are employed to modulate deep feature maps, reducing inference and training time while minimizing model complexity. This approach avoids the complexity associated with transformer architectures. Experiments on the KITTI 3D dataset demonstrate that the method diminishes reliance on manual labeling and can notably enhance detection accuracy while simultaneously improving processing speed.
引用
收藏
页码:162990 / 163000
页数:11
相关论文
共 44 条
  • [1] A survey on adversarial attacks and defenses for object detection and their applications in autonomous vehicles
    Amirkhani, Abdollah
    Karimi, Mohammad Parsa
    Banitalebi-Dehkordi, Amin
    [J]. VISUAL COMPUTER, 2023, 39 (11) : 5293 - 5307
  • [2] Enhancing the Robustness of Visual Object Tracking via Style Transfer
    Amirkhani, Abdollah
    Barshooi, Amir Hossein
    Ebrahimi, Amir
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01): : 981 - 997
  • [3] Robust Semantic Segmentation With Multi-Teacher Knowledge Distillation
    Amirkhani, Abdollah
    Khosravian, Amir
    Masih-Tehrani, Masoud
    Kashiani, Hossein
    [J]. IEEE ACCESS, 2021, 9 : 119049 - 119066
  • [4] EBCDet: Energy-Based Curriculum for Robust Domain Adaptive Object Detection
    Banitalebi-Dehkordi, Amin
    Amirkhani, Abdollah
    Mohammadinasab, Alireza
    [J]. IEEE ACCESS, 2023, 11 : 77810 - 77825
  • [5] M3D-RPN: Monocular 3D Region Proposal Network for Object Detection
    Brazil, Garrick
    Liu, Xiaoming
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9286 - 9295
  • [6] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [7] Chen XZ, 2015, ADV NEUR IN, V28
  • [8] MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships
    Chen, Yongjian
    Tai, Lei
    Sun, Kai
    Li, Mingyang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 12090 - 12099
  • [9] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [10] Fast Convergence of DETR with Spatially Modulated Co-Attention
    Gao, Peng
    Zheng, Minghang
    Wang, Xiaogang
    Dai, Jifeng
    Li, Hongsheng
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3601 - 3610