Leveraging front and side cues for occlusion handling in monocular 3D object detection

被引:0
|
作者
Yuying Song
Zecheng Li
Jingxuan Wu
Chunyi Song
Zhiwei Xu
机构
[1] Ocean College,The Institute of Marine Electronic and Intelligent System
[2] Zhejiang University,undefined
[3] The Engineering Research Center of Oceanic Sensing Technology and Equipment,undefined
[4] Ministry of Education,undefined
[5] The Donghai Laboratory,undefined
来源
The Visual Computer | 2024年 / 40卷
关键词
Monocular object detection; Occlusion Handling; Compositional model; Uncertainty; Attention mechanism; Autonomous driving;
D O I
暂无
中图分类号
学科分类号
摘要
3D object detection, as an essential part of perception, plays a principal role in the autonomous driving system. The cost-competitive monocular 3D object detection has drawn increasing attention recently. However, it still suffers an inferior accuracy especially for occluded objects due to the limited camera view. Inspired by compositional models, in which an object is represented as a combination of multiple components, this paper proposes a new monocular 3D object detection method that decreases the impact of occlusion by utilizing an object’s front and side cues. To do this, the features are extracted from a decoupled front and side representation and then fused by an attention-based module to obtain a more consistent feature distribution. An uncertainty-guided depth ensemble based on geometry is further applied to refine the depth prediction. Experiment results demonstrate that as compared to the conventional methods, the proposed method significantly improves the detection performance for occluded objects while still satisfying real-time efficiency, with the Average Precision on 40 recall positions (AP40), respectively, increasing by 10.23% for partly occluded objects and 12.22% for mostly occluded objects in the KITTI benchmark. The codes are released at https://github.com/kagurua/Front-Side-Det
引用
收藏
页码:1757 / 1773
页数:16
相关论文
共 50 条
  • [1] Leveraging front and side cues for occlusion handling in monocular 3D object detection
    Song, Yuying
    Li, Zecheng
    Wu, Jingxuan
    Song, Chunyi
    Xu, Zhiwei
    VISUAL COMPUTER, 2024, 40 (03): : 1757 - 1773
  • [2] Visibility of points: Mining occlusion cues for monocular 3D object detection
    Chu, Huazhen
    Mo, Lisha
    Wang, Rongquan
    Hu, Tianyu
    Ma, Huimin
    NEUROCOMPUTING, 2022, 502 : 48 - 56
  • [3] Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction
    Ku, Jason
    Pon, Alex D.
    Waslander, Steven L.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11859 - 11868
  • [4] Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues
    Julca-Aguilar, Frank
    Taylor, Jason
    Bijelic, Mario
    Mannan, Fahim
    Tseng, Ethan
    Heide, Felix
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2918 - 2928
  • [5] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [6] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [7] Occlusion-Aware Plane-Constraints for Monocular 3D Object Detection
    Yao, Hongdou
    Chen, Jun
    Wang, Zheng
    Wang, Xiao
    Han, Pengfei
    Chai, Xiaoyu
    Qiu, Yansheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) : 4593 - 4605
  • [8] Monocular 3D Object Detection for Autonomous Driving
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhang, Ziyu
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2147 - 2156
  • [9] Dimension Embeddings for Monocular 3D Object Detection
    Zhang, Yunpeng
    Zheng, Wenzhao
    Zhu, Zheng
    Huang, Guan
    Du, Dalong
    Zhou, Jie
    Lu, Jiwen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1579 - 1588
  • [10] Multivariate Probabilistic Monocular 3D Object Detection
    Shi, Xuepeng
    Chen, Zhixiang
    Kim, Tae-Kyun
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4270 - 4279