PPF-Net: Efficient Multimodal 3D Object Detection with Pillar-Point Fusion

被引:0
|
作者
Zhang, Lingxiao [1 ]
Li, Changyong [1 ]
机构
[1] Xinjiang Univ, Coll Mech Engn, Urumqi 830017, Peoples R China
来源
ELECTRONICS | 2025年 / 14卷 / 04期
关键词
3D object detection; cross-modal data augmentation; sensor fusion; joint regression loss function;
D O I
10.3390/electronics14040685
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting objects in 3D space using LiDAR is crucial for robotics and autonomous vehicles, but the sparsity of LiDAR-generated point clouds limits performance. Camera images, rich in semantic information, can effectively compensate for this limitation. We propose a simpler yet effective multimodal fusion framework to enhance 3D object detection without complex network designs. We introduce a cross-modal GT-Paste data augmentation method to address challenges like 2D object acquisition and occlusions from added objects. To better integrate image features with sparse point clouds, we propose Pillar-Point Fusion (PPF), which projects non-empty pillars onto image feature maps and uses an attention mechanism to map semantic features from pillars to their constituent points, fusing them with the points' geometric features. Additionally, we design the BD-IoU loss function, which measures 3D bounding box similarity, and a joint regression loss combining BD-IoU and Smooth L1, effectively guiding model training. Our framework achieves consistent improvements across KITTI benchmarks. On the validation set, PFF (PV-RCNN) achieves at least 1.84% AP improvement in Cyclist detection performance across all difficulty levels compared to other multimodal SOTA methods. On the test set, PPF-Net excels in pedestrian detection for moderate and hard difficulty levels and achieves the best results in low-beam LiDAR scenarios.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] A Multimodal 3D Object Detection Method Based on Double-Fusion Framework
    Ge T.-A.
    Li H.
    Guo Y.
    Wang J.-Y.
    Zhou D.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (11): : 3100 - 3110
  • [22] DMFF: dual-way multimodal feature fusion for 3D object detection
    Xiaopeng Dong
    Xiaoguang Di
    Wenzhuang Wang
    Signal, Image and Video Processing, 2024, 18 (1) : 455 - 463
  • [23] Pillar-Based 3D Object Detection from Point Cloud with Multiattention Mechanism
    Li X.
    Liang B.
    Huang J.
    Peng Y.
    Yan Y.
    Li J.
    Shang W.
    Wei W.
    Wireless Communications and Mobile Computing, 2023, 2023
  • [24] AEPF: Attention-Enabled Point Fusion for 3D Object Detection
    Sharma, Sachin
    Meyer, Richard T.
    Asher, Zachary D.
    SENSORS, 2024, 24 (17)
  • [25] 3D Object Detection Based on Feature Fusion of Point Cloud Sequences
    Zhai, Zhenyu
    Wang, Qiantong
    Pan, Zongxu
    Hu, Wenlong
    Hu, Yuxin
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1240 - 1245
  • [26] MixedFusion: An Efficient Multimodal Data Fusion Framework for 3-D Object Detection and Tracking
    Zhang, Cheng
    Wang, Hai
    Chen, Long
    Li, Yicheng
    Cai, Yingfeng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1842 - 1856
  • [27] MixedFusion: An Efficient Multimodal Data Fusion Framework for 3-D Object Detection and Tracking
    Zhang, Cheng
    Wang, Hai
    Chen, Long
    Li, Yicheng
    Cai, Yingfeng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1842 - 1856
  • [28] Multimodal 3D Histogram for Moving Object Detection
    Mukherjee, Dibyendu
    Saha, Ashirbani
    Wu, Q. M. Jonathan
    Jiang, Wei
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 2397 - 2402
  • [29] RPFA-Net: a 4D RaDAR Pillar Feature Attention Network for 3D Object Detection
    Xu, Baowei
    Zhang, Xinyu
    Wang, Li
    Hu, Xiaomei
    Li, Zhiwei
    Pan, Shuyue
    Li, Jun
    Deng, Yongqiang
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 3061 - 3066
  • [30] RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection
    Sun, Pei
    Wang, Weiyue
    Chai, Yuning
    Elsayed, Gamaleldin
    Bewley, Alex
    Zhang, Xiao
    Sminchisescu, Cristian
    Anguelov, Dragomir
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5721 - 5730