ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates

被引:5
|
作者
Tao, Manli [1 ,2 ]
Zhao, Chaoyang [1 ,3 ]
Wang, Jinqiao [1 ,2 ,3 ]
Tang, Ming [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] ObjectEye Inc, Beijing 100000, Peoples R China
关键词
Three-dimensional displays; Proposals; Object detection; Feature extraction; Point cloud compression; Aggregates; Sun; 3D object detection; image candidates; pseudo 3D proposal; target missing; NETWORK;
D O I
10.1109/LSP.2023.3336569
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-modal fusion methods combine the advantages of both point clouds and RGB images to boost the performance of 3D object detection. Despite the significant progress, we find that existing two-stage multi-modal fusion methods suffer from the 3D proposal missing in the first stage and projected-style feature fusion mechanism. To solve these problems, we propose a two-stage multi-modal feature fusion network, which improves the recall rate of hard targets in the first stage of network with pseudo 3D proposals generated from image candidates. Then, considering the complementary information between similar image foreground features across multiple objects, we design a multi-modal cross-target fusion module to pay more attention to the foreground objects. It enables a 3D proposal can aggregate the semantic features of multiple image candidates belonging to the same category. Finally, these enhanced fused proposals are processed in the second stage to further boost the performance of 3D detector. Experimental results on SUN RGB-D and KITTI datasets show the effectiveness of our proposed method.
引用
收藏
页码:241 / 245
页数:5
相关论文
共 50 条
  • [31] Geometry-Guided Point Generation for 3D Object Detection
    Wang, Kai
    Zhou, Mingliang
    Lin, Qing
    Niu, Guanglin
    Zhang, Xiaowei
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 136 - 140
  • [32] Context-Aware 3D Object Detection From a Single Image in Autonomous Driving
    Zhou, Dingfu
    Song, Xibin
    Fang, Jin
    Dai, Yuchao
    Li, Hongdong
    Zhang, Liangjun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18568 - 18580
  • [33] Fully Sparse Fusion for 3D Object Detection
    Li, Yingyan
    Fan, Lue
    Liu, Yang
    Huang, Zehao
    Chen, Yuntao
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7217 - 7231
  • [34] SMURF: Spatial Multi-Representation Fusion for 3D Object Detection With 4D Imaging Radar
    Liu, Jianan
    Zhao, Qiuchi
    Xiong, Weiyi
    Huang, Tao
    Han, Qing-Long
    Zhu, Bing
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 799 - 812
  • [35] VFL3D: A Single-Stage Fine-Grained Lightweight Point Cloud 3D Object Detection Algorithm Based on Voxels
    Li, Bing
    Chen, Jie
    Li, Xinde
    Xu, Rui
    Li, Qian
    Cao, Yice
    Wu, Jun
    Qu, Lei
    Li, Yingsong
    Diniz, Paulo S. R.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 12034 - 12048
  • [36] DA-Net: Density-Aware 3D Object Detection Network for Point Clouds
    Wang, Shuhua
    Lu, Ke
    Xue, Jian
    Zhao, Yang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 665 - 678
  • [37] ContextNet: Leveraging Comprehensive Contextual Information for Enhanced 3D Object Detection
    Pei, Caiyan
    Zhang, Shuai
    Cao, Lijun
    Zhao, Liqiang
    IEEE ACCESS, 2024, 12 : 106744 - 106756
  • [38] Multimodal 3D Object Detection Based on Sparse Interaction in Internet of Vehicles
    Li, Hui
    Ge, Tongao
    Bai, Keqiang
    Nie, Gaofeng
    Xu, Lingwei
    Ai, Xiaoxue
    Cao, Song
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 2174 - 2186
  • [39] BEVFusion With Dual Hard Instance Probing for Multimodal 3D Object Detection
    Kim, Taeho
    Kim, Joohee
    IEEE ACCESS, 2025, 13 : 25546 - 25556
  • [40] CMAN: Leaning Global Structure Correlation for Monocular 3D Object Detection
    Cao, Yuanzhouhan
    Zhang, Hui
    Li, Yidong
    Ren, Chao
    Lang, Congyan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24727 - 24737