Deformable Feature Fusion Network for Multi-Modal 3D Object Detection

被引:0
|
作者
Guo, Kun [1 ]
Gan, Tong [2 ]
Ding, Zhao [3 ]
Ling, Qiang [1 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei, Peoples R China
[2] Anhui ShineAuto Autonomous Driving Technol Co Ltd, Res & Dev Dept, Hefei, Peoples R China
[3] Anhui JiangHuai Automobile Grp Co Ltd, Inst Intelligent & Networked Automobile, Hefei, Peoples R China
来源
2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024 | 2024年
关键词
3D object detection; multi-modal fusion; feature alignment; VOXELNET;
D O I
10.1109/RAIIC61787.2024.10670940
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
LiDAR and cameras are two widely used sensors in 3D object detection. LiDAR point clouds show geometry knowledge of objects, while RGB images provide semantic information, such as color and texture. How to effectively fuse their features is the key to improving detection performance. This paper proposes a Deformable Feature Fusion Network, which performs LiDAR-camera fusion in a flexible way. We present multi-modal features in the bird's-eye view(BEV), and build a Deformable-Attention Fusion(DAF) module to conduct feature fusion. Besides fusion methods, feature alignment is also important in multi-modal detection. Data augmentation of point clouds may change the projection relationship between RGB images and LiDAR point clouds and causes feature misalignment. We introduce a Feature Alignment Transform(FAT) module and alleviate the problem without introducing any trainable parameters. We conduct experiments on the KITTI dataset to evaluate the effectiveness of proposed modules and the experiment results show that our method outperforms most existing methods.
引用
收藏
页码:363 / 367
页数:5
相关论文
共 50 条
  • [41] Test method of laser paint removal based on multi-modal feature fusion
    Huang Hai-peng
    Hao Ben-tian
    Ye De-jun
    Gao Hao
    Li Liang
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2022, 29 (10) : 3385 - 3398
  • [42] Electromagnetic signal feature fusion and recognition based on multi-modal deep learning
    Hou C.
    Zhang X.
    Chen X.
    International Journal of Performability Engineering, 2020, 16 (06): : 941 - 949
  • [43] Adaptively multi-modal contrastive fusion network for molecular properties prediction
    Tang, Wenyan
    Li, Meng
    Zhan, Yi
    Chen, Bin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 152
  • [44] MFFNet: Multimodal feature fusion network for RGB-D transparent object detection
    Zhu, Li
    Li, Tuanjie
    Ning, Yuming
    Zhang, Yan
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2024, 21 (05):
  • [45] A Depression Detection Auxiliary Decision System Based on Multi-Modal Feature-Level Fusion of EEG and Speech
    Ning, Zhaolong
    Hu, Hao
    Yi, Ling
    Qie, Zihan
    Tolba, Amr
    Wang, Xiaojie
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 3392 - 3402
  • [46] EISNet: A Multi-Modal Fusion Network for Semantic Segmentation With Events and Images
    Xie, Bochen
    Deng, Yongjian
    Shao, Zhanpeng
    Li, Youfu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8639 - 8650
  • [47] Multi-modal deep fusion based fake news detection method
    Jing Q.
    Fan X.
    Wang B.
    Bi J.
    Tan H.
    High Technology Letters, 2022, 32 (04) : 392 - 403
  • [48] Channelwise and Spatially Guided Multimodal Feature Fusion Network for 3-D Object Detection in Autonomous Vehicles
    Uzair, Muhammad
    Dong, Jian
    Shi, Ronghua
    Mushtaq, Husnain
    Ullah, Irshad
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [49] An Abnormal External Link Detection Algorithm Based on Multi-Modal Fusion
    Wu, Zhiqiang
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2024, 18 (01)
  • [50] Strong-Weak Feature Alignment for 3D Object Detection
    Wang, Zhiyu
    Wang, Li
    Dai, Bin
    ELECTRONICS, 2021, 10 (10)