ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates

被引：5

作者：

Tao, Manli ^{[1
,2
]}

Zhao, Chaoyang ^{[1
,3
]}

Wang, Jinqiao ^{[1
,2
,3
]}

Tang, Ming ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] ObjectEye Inc, Beijing 100000, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

关键词：

Three-dimensional displays; Proposals; Object detection; Feature extraction; Point cloud compression; Aggregates; Sun; 3D object detection; image candidates; pseudo 3D proposal; target missing; NETWORK;

D O I：

10.1109/LSP.2023.3336569

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Multi-modal fusion methods combine the advantages of both point clouds and RGB images to boost the performance of 3D object detection. Despite the significant progress, we find that existing two-stage multi-modal fusion methods suffer from the 3D proposal missing in the first stage and projected-style feature fusion mechanism. To solve these problems, we propose a two-stage multi-modal feature fusion network, which improves the recall rate of hard targets in the first stage of network with pseudo 3D proposals generated from image candidates. Then, considering the complementary information between similar image foreground features across multiple objects, we design a multi-modal cross-target fusion module to pay more attention to the foreground objects. It enables a 3D proposal can aggregate the semantic features of multiple image candidates belonging to the same category. Finally, these enhanced fused proposals are processed in the second stage to further boost the performance of 3D detector. Experimental results on SUN RGB-D and KITTI datasets show the effectiveness of our proposed method.

引用

页码：241 / 245

页数：5

共 50 条

[31] Geometry-Guided Point Generation for 3D Object Detection
Wang, Kai
Zhou, Mingliang
Lin, Qing
Niu, Guanglin
Zhang, Xiaowei
IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 136 - 140
[32] Context-Aware 3D Object Detection From a Single Image in Autonomous Driving
Zhou, Dingfu
Song, Xibin
Fang, Jin
Dai, Yuchao
Li, Hongdong
Zhang, Liangjun
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18568 - 18580
[33] Fully Sparse Fusion for 3D Object Detection
Li, Yingyan
Fan, Lue
Liu, Yang
Huang, Zehao
Chen, Yuntao
Wang, Naiyan
Zhang, Zhaoxiang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7217 - 7231
[34] SMURF: Spatial Multi-Representation Fusion for 3D Object Detection With 4D Imaging Radar
Liu, Jianan
Zhao, Qiuchi
Xiong, Weiyi
Huang, Tao
Han, Qing-Long
Zhu, Bing
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 799 - 812
[35] VFL3D: A Single-Stage Fine-Grained Lightweight Point Cloud 3D Object Detection Algorithm Based on Voxels
Li, Bing
Chen, Jie
Li, Xinde
Xu, Rui
Li, Qian
Cao, Yice
Wu, Jun
Qu, Lei
Li, Yingsong
Diniz, Paulo S. R.
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 12034 - 12048
[36] DA-Net: Density-Aware 3D Object Detection Network for Point Clouds
Wang, Shuhua
Lu, Ke
Xue, Jian
Zhao, Yang
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 665 - 678
[37] ContextNet: Leveraging Comprehensive Contextual Information for Enhanced 3D Object Detection
Pei, Caiyan
Zhang, Shuai
Cao, Lijun
Zhao, Liqiang
IEEE ACCESS, 2024, 12 : 106744 - 106756
[38] Multimodal 3D Object Detection Based on Sparse Interaction in Internet of Vehicles
Li, Hui
Ge, Tongao
Bai, Keqiang
Nie, Gaofeng
Xu, Lingwei
Ai, Xiaoxue
Cao, Song
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 2174 - 2186
[39] BEVFusion With Dual Hard Instance Probing for Multimodal 3D Object Detection
Kim, Taeho
Kim, Joohee
IEEE ACCESS, 2025, 13 : 25546 - 25556
[40] CMAN: Leaning Global Structure Correlation for Monocular 3D Object Detection
Cao, Yuanzhouhan
Zhang, Hui
Li, Yidong
Ren, Chao
Lang, Congyan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24727 - 24737

← 1 2 3 4 5 →