ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates

被引:5
|
作者
Tao, Manli [1 ,2 ]
Zhao, Chaoyang [1 ,3 ]
Wang, Jinqiao [1 ,2 ,3 ]
Tang, Ming [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] ObjectEye Inc, Beijing 100000, Peoples R China
关键词
Three-dimensional displays; Proposals; Object detection; Feature extraction; Point cloud compression; Aggregates; Sun; 3D object detection; image candidates; pseudo 3D proposal; target missing; NETWORK;
D O I
10.1109/LSP.2023.3336569
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-modal fusion methods combine the advantages of both point clouds and RGB images to boost the performance of 3D object detection. Despite the significant progress, we find that existing two-stage multi-modal fusion methods suffer from the 3D proposal missing in the first stage and projected-style feature fusion mechanism. To solve these problems, we propose a two-stage multi-modal feature fusion network, which improves the recall rate of hard targets in the first stage of network with pseudo 3D proposals generated from image candidates. Then, considering the complementary information between similar image foreground features across multiple objects, we design a multi-modal cross-target fusion module to pay more attention to the foreground objects. It enables a 3D proposal can aggregate the semantic features of multiple image candidates belonging to the same category. Finally, these enhanced fused proposals are processed in the second stage to further boost the performance of 3D detector. Experimental results on SUN RGB-D and KITTI datasets show the effectiveness of our proposed method.
引用
收藏
页码:241 / 245
页数:5
相关论文
共 50 条
  • [41] 3D Cascade RCNN: High Quality Object Detection in Point Clouds
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Mei, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5706 - 5719
  • [42] MD3D: Mixture-Density-Based 3D Object Detection in Point Clouds
    Choi, Jaeseok
    Song, Yeji
    Kim, Yerim
    Yoo, Jaeyoung
    Kwak, Nojun
    IEEE ACCESS, 2022, 10 : 104011 - 104022
  • [43] SimLOG: Simultaneous Local-Global Feature Learning for 3D Object Detection in Indoor Point Clouds
    Wei, Mingqiang
    Chen, Baian
    Nan, Liangliang
    Xie, Haoran
    Gu, Lipeng
    Lu, Dening
    Wang, Fu Lee
    Li, Qing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 19482 - 19495
  • [44] SASAN: Shape-Adaptive Set Abstraction Network for Point-Voxel 3D Object Detection
    Zhang, Hui
    Luo, Guiyang
    Wang, Xiao
    Li, Yidong
    Ding, Weiping
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 2465 - 2479
  • [45] Few-Shot Object Detection of Remote Sensing Images via Two-Stage Fine-Tuning
    Zhao, Zhitao
    Tang, Ping
    Zhao, Lijun
    Zhang, Zheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [46] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [47] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [48] MonoGRNet: A General Framework for Monocular 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184
  • [49] Deep Neural Network Pruning Based Two-Stage Remote Sensing Image Object Detection
    Wang S.-S.
    Wang M.
    Wang G.-Y.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2019, 40 (02): : 174 - 179
  • [50] 3D Multi-Object Tracking based on Two-Stage Data Association for Collaborative Perception Scenarios
    Su, Hao
    Arakawa, Shin'ichi
    Murata, Masayuki
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,