FCNet: Stereo 3D Object Detection with Feature Correlation Networks

被引:3
|
作者
Wu, Yingyu [1 ]
Liu, Ziyan [1 ,2 ,3 ]
Chen, Yunlei [1 ]
Zheng, Xuhui [1 ]
Zhang, Qian [1 ]
Yang, Mo [1 ]
Tang, Guangming [3 ]
机构
[1] Guizhou Univ, Coll Big Data & Informat Engn, Guiyang 550025, Peoples R China
[2] Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
关键词
3D object detection; deep learning; stereo matching; multi-scale cost-volume; channel similarity; parallel convolutional attention;
D O I
10.3390/e24081121
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Deep-learning techniques have significantly improved object detection performance, especially with binocular images in 3D scenarios. To supervise the depth information in stereo 3D object detection, reconstructing the 3D dense depth of LiDAR point clouds causes higher computational costs and lower inference speed. After exploring the intrinsic relationship between the implicit depth information and semantic texture features of the binocular images, we propose an efficient and accurate 3D object detection algorithm, FCNet, in stereo images. First, we construct a multi-scale cost-volume containing implicit depth information using the normalized dot-product by generating multi-scale feature maps from the input stereo images. Secondly, the variant attention model enhances its global and local description, and the sparse region monitors the depth loss deep regression. Thirdly, for balancing the channel information preservation of the re-fused left-right feature maps and computational burden, a reweighting strategy is employed to enhance the feature correlation in merging the last-layer features of binocular images. Extensive experiment results on the challenging KITTI benchmark demonstrate that the proposed algorithm achieves better performance, including a lower computational cost and higher inference speed in 3D object detection.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] 3D ResNets for 3D Object Classification
    Ioannidou, Anastasia
    Chatzilari, Elisavet
    Nikolopoulos, Spiros
    Kompatsiaris, Ioannis
    MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 495 - 506
  • [32] CorDet: Corner-Aware 3D Object Detection Networks for Automated Scan-to-BIM
    Xu, Yongzhi
    Shen, Xuesong
    Lim, Samsung
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2021, 35 (03)
  • [33] 3D Object Detection for Autonomous Driving: A Comprehensive Survey
    Mao, Jiageng
    Shi, Shaoshuai
    Wang, Xiaogang
    Li, Hongsheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 1909 - 1963
  • [34] 3D Object Detection for Autonomous Driving: A Comprehensive Survey
    Jiageng Mao
    Shaoshuai Shi
    Xiaogang Wang
    Hongsheng Li
    International Journal of Computer Vision, 2023, 131 : 1909 - 1963
  • [35] 3D Object Detection for Autonomous Driving: A Practical Survey
    Ramajo-Ballester, Alvaro
    de la Escalera Hueso, Arturo
    Armingol Moreno, Jose Maria
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS, VEHITS 2023, 2023, : 64 - 73
  • [36] 3D object detection algorithms in autonomous driving: A review
    Ren K.-Y.
    Gu M.-Y.
    Yuan Z.-Q.
    Yuan S.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (04): : 865 - 889
  • [37] Robust stereo for 3D acquisition
    Menard, C
    THREE-DIMENSIONAL IMAGE CAPTURE, 1997, 3023 : 180 - 190
  • [38] Pyramid Frequency Feature Fusion Object Detection Networks
    Mao L.
    Li X.
    Yang D.
    Zhang R.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (02): : 207 - 214
  • [39] Improved 3D Object Detection Method Based on PointPillars
    Han, Zhenguo
    Li, Xu
    Xu, Hengxin
    Song, Hongzheng
    2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 163 - 166
  • [40] SMS3D: 3D Synthetic Mushroom Scenes Dataset for 3D Object Detection and Pose Estimation
    Zakeri, Abdollah
    Koirala, Bikram
    Kang, Jiming
    Balan, Venkatesh
    Zhu, Weihang
    Benhaddou, Driss
    Merchant, Fatima A.
    COMPUTERS, 2025, 14 (04)