CenterTransFuser: radar point cloud and visual information fusion for 3D object detection

被引:2
|
作者
Li, Yan [1 ]
Zeng, Kai [1 ]
Shen, Tao [1 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-transformer; Depth threshold filtering; 3D detection; Cross-modal fusion; Contextual interaction;
D O I
10.1186/s13634-022-00944-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sensor fusion is an important component of the perception system in autonomous driving, and the fusion of radar point cloud information and camera visual informa-tion can improve the perception capability of autonomous vehicles. However, most of the existing studies ignore the extraction of local neighborhood information and only consider shallow fusion between the two modalities based on the extracted global information, which cannot perform a deep fusion of cross-modal contextual informa-tion interaction. Meanwhile, in data preprocessing, the noise in radar data is usually only filtered by the depth information derived from image feature prediction, and such methods affect the accuracy of radar branching to generate regions of interest and cannot effectively filter out irrelevant information of radar points. This paper proposes the CenterTransFuser model that makes full use of millimeter-wave radar point cloud information and visual information to enable cross-modal fusion of the two heteroge-neous information. Specifically, a new interaction called cross-transformer is explored, which cooperatively exploits cross-modal cross-multiple attention and joint cross-mul-tiple attention to mine radar and image complementary information. Meanwhile, an adaptive depth thresholding filtering method is designed to reduce the noise of radar modality-independent information projected onto the image. The CenterTransFuser model is evaluated on the challenging nuScenes dataset, and it achieves excellent per-formance. Particularly, the detection accuracy is significantly improved for pedestrians, motorcycles, and bicycles, showing the superiority and effectiveness of the proposed model.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] CenterTransFuser: radar point cloud and visual information fusion for 3D object detection
    Yan Li
    Kai Zeng
    Tao Shen
    EURASIP Journal on Advances in Signal Processing, 2023
  • [2] Point cloud 3D object detection algorithm based on local information fusion
    Zhang, Linjie
    Chai, Zhilei
    Wang, Ning
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (11): : 2219 - 2229
  • [3] Point Cloud Painting for 3D Object Detection with Camera and Automotive 3+1D RADAR Fusion
    Montiel-Marin, Santiago
    Llamazares, Angel
    Antunes, Miguel
    Revenga, Pedro A.
    Bergasa, Luis M.
    SENSORS, 2024, 24 (04)
  • [4] 3D Object Detection with Fusion Point Attention Mechanism in LiDAR Point Cloud
    Liu Weili
    Zhu Deli
    Luo Huahao
    Li Yi
    ACTA PHOTONICA SINICA, 2023, 52 (09)
  • [5] 3D Object Detection Based on Deep Semantics and Position Information Fusion of Laser Point Cloud
    Hu Jie
    An Yongpeng
    Xu Wencai
    Xiong Zongquan
    Liu Han
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2023, 50 (10):
  • [6] 3D Point Cloud Object Detection Algorithm Based on Temporal Information Fusion and Uncertainty Estimation
    Xie, Guangda
    Li, Yang
    Wang, Yanping
    Li, Ziyi
    Qu, Hongquan
    REMOTE SENSING, 2023, 15 (12)
  • [7] 3D Object Detection Based on Feature Fusion of Point Cloud Sequences
    Zhai, Zhenyu
    Wang, Qiantong
    Pan, Zongxu
    Hu, Wenlong
    Hu, Yuxin
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1240 - 1245
  • [8] Radar Voxel Fusion for 3D Object Detection
    Nobis, Felix
    Shafiei, Ehsan
    Karle, Phillip
    Betz, Johannes
    Lienkamp, Markus
    APPLIED SCIENCES-BASEL, 2021, 11 (12):
  • [9] Point cloud 3D object detection method based on density information-local feature fusion
    Yanjie Chen
    Feng Xu
    Guodong Chen
    Zhiqiang Liang
    Jin Li
    Multimedia Tools and Applications, 2024, 83 : 2407 - 2425
  • [10] Point cloud 3D object detection method based on density information-local feature fusion
    Chen, Yanjie
    Xu, Feng
    Chen, Guodong
    Liang, Zhiqiang
    Li, Jin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (1) : 2407 - 2425