ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates

被引：5

作者：

Tao, Manli ^{[1
,2
]}

Zhao, Chaoyang ^{[1
,3
]}

Wang, Jinqiao ^{[1
,2
,3
]}

Tang, Ming ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] ObjectEye Inc, Beijing 100000, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2024年 / 31卷

关键词：

Three-dimensional displays; Proposals; Object detection; Feature extraction; Point cloud compression; Aggregates; Sun; 3D object detection; image candidates; pseudo 3D proposal; target missing; NETWORK;

D O I：

10.1109/LSP.2023.3336569

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Multi-modal fusion methods combine the advantages of both point clouds and RGB images to boost the performance of 3D object detection. Despite the significant progress, we find that existing two-stage multi-modal fusion methods suffer from the 3D proposal missing in the first stage and projected-style feature fusion mechanism. To solve these problems, we propose a two-stage multi-modal feature fusion network, which improves the recall rate of hard targets in the first stage of network with pseudo 3D proposals generated from image candidates. Then, considering the complementary information between similar image foreground features across multiple objects, we design a multi-modal cross-target fusion module to pay more attention to the foreground objects. It enables a 3D proposal can aggregate the semantic features of multiple image candidates belonging to the same category. Finally, these enhanced fused proposals are processed in the second stage to further boost the performance of 3D detector. Experimental results on SUN RGB-D and KITTI datasets show the effectiveness of our proposed method.

引用

页码：241 / 245

页数：5

共 50 条

[1] TSF: Two-Stage Sequential Fusion for 3D Object Detection
Qi, Heng
Shi, Peicheng
Liu, Zhiqiang
Yang, Aixi
IEEE SENSORS JOURNAL, 2022, 22 (12) : 12163 - 12172
[2] RangeLVDet: Boosting 3D Object Detection in LIDAR With Range Image and RGB Image
Zhang, Zehan
Liang, Zhidong
Zhang, Ming
Zhao, Xian
Li, Hao
Yang, Ming
Tan, Wenming
Pu, Shiliang
IEEE SENSORS JOURNAL, 2022, 22 (02) : 1391 - 1403
[3] TSFF: a two-stage fusion framework for 3D object detection
Jiang, Guoqing
Li, Saiya
Huang, Ziyu
Cai, Guorong
Su, Jinhe
PeerJ Computer Science, 2024, 10
[4] TSFF: a two-stage fusion framework for 3D object detection
Jiang, Guoqing
Li, Saiya
Huang, Ziyu
Cai, Guorong
Su, Jinhe
PEERJ COMPUTER SCIENCE, 2024, 10
[5] HybridPillars: Hybrid Point-Pillar Network for Real-Time Two-Stage 3-D Object Detection
Huang, Zhicong
Huang, Yuxiao
Zheng, Zhijie
Hu, Haifeng
Chen, Dihu
IEEE SENSORS JOURNAL, 2024, 24 (22) : 38318 - 38328
[6] Super Sparse 3D Object Detection
Fan, Lue
Yang, Yuxue
Wang, Feng
Wang, Naiyan
Zhang, Zhaoxiang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12490 - 12505
[7] SOFW: A Synergistic Optimization Framework for Indoor 3D Object Detection
Dai, Kun
Jiang, Zhiqiang
Xie, Tao
Wang, Ke
Liu, Dedong
Fan, Zhendong
Li, Ruifeng
Zhao, Lijun
Omar, Mohamed
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 637 - 651
[8] An Efficient Ungrouped Mask Method With two Learnable Parameters for 3D Object Detection
Guo, Shuai
Shi, Lei
Jiang, Xiaoheng
Lv, Pei
Liu, Qidong
Hu, Yazhou
Ji, Rongrong
Xu, Mingliang
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1003 - 1017
[9] BirdNet plus : Two-Stage 3D Object Detection in LiDAR Through a Sparsity-Invariant Bird's Eye View
Barrera, Alejandro
Beltran, Jorge
Guindel, Carlos
Iglesias, Jose Antonio
Garcia, Fernando
IEEE ACCESS, 2021, 9 : 160299 - 160316
[10] Collaborative 3D Object Detection for Autonomous Vehicles via Learnable Communications
Wang, J.
Zeng, Y.
Gong, Y.
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 9804 - 9816

← 1 2 3 4 5 →