Occlusion-Aware Plane-Constraints for Monocular 3D Object Detection

被引:5
|
作者
Yao, Hongdou [1 ]
Chen, Jun [1 ]
Wang, Zheng [1 ]
Wang, Xiao [2 ]
Han, Pengfei [3 ]
Chai, Xiaoyu [1 ]
Qiu, Yansheng [1 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430072, Peoples R China
[3] Northwestern Polytech Univ, Sch Cybersecur, Xian 710000, Peoples R China
关键词
3D object detection; monocular image; occlusion object; keypoints sampling; adaptive plane constraints;
D O I
10.1109/TITS.2023.3323036
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The task of 3D object detection poses a significant challenge for 3D scene understanding and is primarily employed in the fields of robot control and autonomous driving. Monocular-based 3D detection methods are more cost-effective and practical than stereo-based or LiDAR-based methods. Monocular image 3D detection methods have garnered considerable attention from researchers. However, the impact of occlusion scenarios of the objects on the keypoints prediction is often overlooked. To address this issue, the present paper proposes a novel 3D monocular object detection method named MonOAPC, which is equipped with occlusion-aware plane-constraints. This method can adaptively utilize partial keypoints to infer the plane location of the object based on the level of occlusion and highlights that the introduction of plane constraints is advantageous for the 3D detection task. First, the plane information of the object in 3D space is beneficial to optimize the keypoints regression, and considering that each plane holds different significance, an adaptive plane location inference module is proposed to enhance the keypoints location regression. Second, a novel co-depth estimation module is proposed to jointly estimate the object's spatial location through various depth inference methods, thereby improving the generalization of the object depth estimation. Furthermore, the paper demonstrates that the accuracy of 3D object detection can be indirectly improved by introducing plane information to promote keypoints regression, and that plane information is effective for monocular 3D detection. The experimental outcomes show that the MonOAPC method can attain competitive results.
引用
收藏
页码:4593 / 4605
页数:13
相关论文
共 50 条
  • [21] Leveraging front and side cues for occlusion handling in monocular 3D object detection
    Yuying Song
    Zecheng Li
    Jingxuan Wu
    Chunyi Song
    Zhiwei Xu
    The Visual Computer, 2024, 40 : 1757 - 1773
  • [22] Leveraging front and side cues for occlusion handling in monocular 3D object detection
    Song, Yuying
    Li, Zecheng
    Wu, Jingxuan
    Song, Chunyi
    Xu, Zhiwei
    VISUAL COMPUTER, 2024, 40 (03): : 1757 - 1773
  • [23] MonoCAPE: Monocular 3D object detection with coordinate-aware position embeddings
    Chen, Wenyu
    Chen, Mu
    Fang, Jian
    Zhao, Huaici
    Wang, Guogang
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120
  • [24] MonoGAE: Roadside Monocular 3D Object Detection With Ground-Aware Embeddings
    Yang, Lei
    Zhang, Xinyu
    Yu, Jiaxin
    Li, Jun
    Zhao, Tong
    Wang, Li
    Huang, Yi
    Zhang, Chuang
    Wang, Hong
    Li, Yiming
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17587 - 17601
  • [25] Occlusion-Aware Unsupervised Learning of Monocular Depth, Optical Flow and Camera Pose with Geometric Constraints
    Teng, Qianru
    Chen, Yimin
    Huang, Chen
    FUTURE INTERNET, 2018, 10 (10)
  • [26] GPro3D: Deriving 3D BBox from ground plane in monocular 3D object detection
    Yang, Fan
    Xu, Xinhao
    Chen, Hui
    Guo, Yuchen
    He, Yuwei
    Ni, Kai
    Ding, Guiguang
    NEUROCOMPUTING, 2023, 562
  • [27] Occlusion-Aware Search for Object Retrieval in Clutter
    Bejjani, Wissam
    Agboh, Wisdom C.
    Dogar, Mehmet R.
    Leonetti, Matteo
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4678 - 4685
  • [28] Enhanced Anomaly Detection in 3D Motion Through Language-Inspired Occlusion-Aware Modeling
    Li, Su
    Wang, Liang
    Wang, Jianye
    Zhang, Ziheng
    Zhang, Junjun
    Zhang, Lei
    MULTIMEDIA MODELING, MMM 2025, PT IV, 2025, 15523 : 383 - 397
  • [29] Occlusion-aware 3D Priors for Deep Learning-based Applications
    Ducastel, Olivier
    Lyut, Yangxintong
    Denis, Leon
    Munteanu, Adrian
    2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [30] An Occlusion-Aware Framework for Real-Time 3D Pose Tracking
    Fu, Mingliang
    Leng, Yuquan
    Luo, Haitao
    Zhou, Weijia
    SENSORS, 2018, 18 (08)