Multi-Scale Spatial Transformer Network for LiDAR-Camera 3D Object Detection

被引:0
|
作者
Wang, Zhifan [1 ]
Zhang, Xiaohong [1 ]
Wang, Shidong [2 ]
Xin, Tong [2 ]
Zhang, Haofeng [1 ]
Lu, Jianfeng [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Newcastle Univ, Sch Engn, Newcastle Upon Tyne, Tyne & Wear, England
来源
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2021年
基金
中国国家自然科学基金;
关键词
3D object Detection; LiDAR-Camera Fusion; Projective Transformer Network; Atrous Spatial Pyramid Pooling;
D O I
10.1109/IJCNN52387.2021.9533588
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate 3D object detection has recently aroused interest in the context of emerging autonomous driving technologies. Existing approaches predominantly use LiDAR-Camera fusion method to fulfill this challenging task, while neglecting the fact that LiDAR and camera data are spatially correlated, and cannot well retain the edge information. To solve these problems, in this paper, we propose a novel LiDAR-Camera 3D object detection method, namely the Multi-scale Spatial Transformer Network (MST-Net). The proposed method exploits an innovative spatial alignment scheme based on the projection transformer network (PTN) to mitigate the effects of the perspective view caused by sensors. In the process of generating 3D bounding boxes, the Atrous Spatial Pyramid Pool (ASPP) is applied to spatially aligned fusion features in order to preserve edge information to the greatest extent. Extensive experiments are conducted on the popular dataset KITTI, and the results can demonstrate the superiority of the proposed method. In addition, the effectiveness of these two strategies has been illustrated in ablation studies.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] LiDAR-camera fusion: Dual transformer enhancement for 3D object detection
    Chen, Mu
    Liu, Pengfei
    Zhao, Huaici
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [2] URFormer: Unified Representation LiDAR-Camera 3D Object Detection with Transformer
    Zhang, Guoxin
    Xie, Jun
    Liu, Lin
    Wang, Zhepeng
    Yang, Kuihe
    Song, Ziying
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 401 - 413
  • [3] A LiDAR-Camera Fusion 3D Object Detection Algorithm
    Liu, Leyuan
    He, Jian
    Ren, Keyan
    Xiao, Zhonghua
    Hou, Yibin
    INFORMATION, 2022, 13 (04)
  • [4] SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
    Qin, Yiran
    Wang, Chaoqun
    Kang, Zijian
    Ma, Ningning
    Li, Zhen
    Zhang, Ruimao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21957 - 21967
  • [5] DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection
    Li, Yingwei
    Yu, Adams Wei
    Meng, Tianjian
    Caine, Ben
    Ngiam, Jiquan
    Peng, Daiyi
    Shen, Junyang
    Lu, Yifeng
    Zhou, Denny
    Le, Quoc, V
    Yuille, Alan
    Tan, Mingxing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17161 - 17170
  • [6] Rethinking the Late Fusion of LiDAR-Camera Based 3D Object Detection
    Yu, Lehang
    Zhang, Jing
    Liu, Zhong
    Yue, Haosong
    Chen, Weihai
    2024 IEEE 19TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ICIEA 2024, 2024,
  • [7] MSGFusion: Muti-scale Semantic Guided LiDAR-Camera Fusion for 3D Object Detection
    Zhu, Huming
    Xue, Yiyu
    Cheng, Xinyue
    Hou, Biao
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [8] TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers
    Bai, Xuyang
    Hu, Zeyu
    Zhu, Xinge
    Huang, Qingqiu
    Chen, Yilun
    Fu, Hangbo
    Tai, Chiew-Lan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1080 - 1089
  • [9] Detecting as Labeling: Rethinking LiDAR-Camera Fusion in 3D Object Detection
    Huang, Junjie
    Ye, Yun
    Liang, Zhujin
    Shan, Yi
    Du, Dalong
    COMPUTER VISION-ECCV 2024, PT XXII, 2025, 15080 : 439 - 455
  • [10] Cross-Supervised LiDAR-Camera Fusion for 3D Object Detection
    Zuo, Chao Jie
    Gu, Cao Yu
    Guo, Yi Kun
    Miao, Xiao Dong
    IEEE ACCESS, 2025, 13 : 10447 - 10458