PCDR-DFF: multi-modal 3D object detection based on point cloud diversity representation and dual feature fusion

被引:0
|
作者
Xia, Chenxing [1 ,2 ,3 ]
Li, Xubing [1 ]
Gao, Xiuju [4 ]
Ge, Bin [1 ]
Li, Kuan-Ching [5 ]
Fang, Xianjin [1 ,6 ]
Zhang, Yan [7 ]
Yang, Ke [2 ]
机构
[1] Anhui Univ Sci & Technol, Coll Comp Sci & Engn, Huainan 232001, Peoples R China
[2] Inst Energy, Hefei Comprehens Natl Sci Ctr, Hefei, Anhui, Peoples R China
[3] Anhui Purvar Bigdata Technol Co Ltd, Huainan 232001, Peoples R China
[4] Anhui Univ Sci & Technol, Coll Elect & Informat Engn, Huainan, Anhui, Peoples R China
[5] Providence Univ, Dept Comp Sci & Informat Engn, Taichung, Taiwan
[6] Inst Artificial Intelligence, Hefei Comprehens Natl Sci Ctr, Hefei, Peoples R China
[7] Anhui Univ, Sch Elect & Informat Engn, Hefei, Anhui, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 16期
基金
中国国家自然科学基金;
关键词
3D Object detection; Graph neural networks; Multi-modal; Point cloud;
D O I
10.1007/s00521-024-09561-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, multi-modal 3D object detection techniques based on point clouds and images have received increasing attention. However, existing methods for multi-modal feature fusion are often relatively singular, and single point cloud representation methods also have some limitations. For example, voxelization may result in the loss of fine-grained information, while 2D images lack depth information, which can restrict the accuracy of detection. Therefore, in this work, we propose a novel method for multi-modal 3D object detection based on point cloud diversity representation and dual feature fusion, PCDR-DFF, to improve the prediction accuracy of 3D object detection. Firstly, point clouds are projected to the image coordinate system and extract multi-level features of the point cloud corresponding to the image using a 2D backbone network. Then, the point clouds are jointly characterized using graphs and pillars, and the 3D features of the point clouds are extracted using graph neural networks and residual connectivity. Finally, a dual feature fusion method is designed to improve the accuracy of detection with the help of a well-designed multi-point fusion model and multi-feature fusion mechanism embedded with a spare 3D-U Net. Extensive experiments on the KITTI dataset demonstrate the effectiveness and competitiveness of our proposed models in comparison with other methods.
引用
收藏
页码:9329 / 9346
页数:18
相关论文
共 50 条
  • [1] PCDR-DFF: multi-modal 3D object detection based on point cloud diversity representation and dual feature fusion
    Chenxing Xia
    Xubing Li
    Xiuju Gao
    Bin Ge
    Kuan-Ching Li
    Xianjin Fang
    Yan Zhang
    Ke Yang
    Neural Computing and Applications, 2024, 36 : 9329 - 9346
  • [2] Dual-domain deformable feature fusion for multi-modal 3D object detection
    Wang, Shihao
    Deng, Tao
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [3] Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
    Li, Xin
    Shi, Botian
    Hou, Yuenan
    Wu, Xingjiao
    Ma, Tianlong
    Li, Yikang
    He, Liang
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 691 - 707
  • [4] Multi-modal feature fusion for 3D object detection in the production workshop
    Hou, Rui
    Chen, Guangzhu
    Han, Yinhe
    Tang, Zaizuo
    Ru, Qingjun
    APPLIED SOFT COMPUTING, 2022, 115
  • [5] Deformable Feature Fusion Network for Multi-Modal 3D Object Detection
    Guo, Kun
    Gan, Tong
    Ding, Zhao
    Ling, Qiang
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 363 - 367
  • [6] Frustum FusionNet: Amodal 3D Object Detection with Multi-Modal Feature Fusion
    Zuo, Liangyu
    Li, Yaochen
    Han, Mengtao
    Li, Qiao
    Liu, Yuehu
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2746 - 2751
  • [7] Research on 3D Object Detection Method Based on Multi-Modal Fusion
    Tian, Feng
    Zong, Neili
    Liu, Fang
    Lu, Yuanyuan
    Liu, Chao
    Jiang, Wenwen
    Zhao, Ling
    Han, Yuxiang
    Computer Engineering and Applications, 2024, 60 (13) : 113 - 123
  • [8] Generating Adversarial Point Clouds on Multi-modal Fusion Based 3D Object Detection Model
    Wang, Huiying
    Shen, Huixin
    Zhang, Boyang
    Wen, Yu
    Meng, Dan
    INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2021), PT I, 2021, 12918 : 187 - 203
  • [9] 3D Object Detection Based on Feature Fusion of Point Cloud Sequences
    Zhai, Zhenyu
    Wang, Qiantong
    Pan, Zongxu
    Hu, Wenlong
    Hu, Yuxin
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1240 - 1245
  • [10] Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection
    Liu, Zhanwen
    Cheng, Juanru
    Fan, Jin
    Lin, Shan
    Wang, Yang
    Zhao, Xiangmo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 707 - 717