Occlusion-Aware Plane-Constraints for Monocular 3D Object Detection

被引：8

作者：

Yao, Hongdou ^{[1
]}

Chen, Jun ^{[1
]}

Wang, Zheng ^{[1
]}

Wang, Xiao ^{[2
]}

Han, Pengfei ^{[3
]}

Chai, Xiaoyu ^{[1
]}

Qiu, Yansheng ^{[1
]}

机构：

[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Peoples R China

[2] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430072, Peoples R China

[3] Northwestern Polytech Univ, Sch Cybersecur, Xian 710000, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 05期

关键词：

3D object detection; monocular image; occlusion object; keypoints sampling; adaptive plane constraints;

D O I：

10.1109/TITS.2023.3323036

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

The task of 3D object detection poses a significant challenge for 3D scene understanding and is primarily employed in the fields of robot control and autonomous driving. Monocular-based 3D detection methods are more cost-effective and practical than stereo-based or LiDAR-based methods. Monocular image 3D detection methods have garnered considerable attention from researchers. However, the impact of occlusion scenarios of the objects on the keypoints prediction is often overlooked. To address this issue, the present paper proposes a novel 3D monocular object detection method named MonOAPC, which is equipped with occlusion-aware plane-constraints. This method can adaptively utilize partial keypoints to infer the plane location of the object based on the level of occlusion and highlights that the introduction of plane constraints is advantageous for the 3D detection task. First, the plane information of the object in 3D space is beneficial to optimize the keypoints regression, and considering that each plane holds different significance, an adaptive plane location inference module is proposed to enhance the keypoints location regression. Second, a novel co-depth estimation module is proposed to jointly estimate the object's spatial location through various depth inference methods, thereby improving the generalization of the object depth estimation. Furthermore, the paper demonstrates that the accuracy of 3D object detection can be indirectly improved by introducing plane information to promote keypoints regression, and that plane information is effective for monocular 3D detection. The experimental outcomes show that the MonOAPC method can attain competitive results.

引用

页码：4593 / 4605

页数：13

共 56 条

[21] EPnP: An Accurate O(n) Solution to the PnP Problem [J].

Lepetit, Vincent ;

Moreno-Noguer, Francesc ;

Fua, Pascal .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 81 (02) :155-166

[22] RTM3D: Real-Time Monocular 3D Detection from Object Keypoints for Autonomous Driving [J].

Li, Peixuan ;

Zhao, Huaici ;

Liu, Pengfei ;

Cao, Feidao .

COMPUTER VISION - ECCV 2020, PT III, 2020, 12348 :644-660

[23] Monocular 3D Detection With Geometric Constraint Embedding and Semi-Supervised Training [J].

Li, Peixuan ;

Zhao, Huaici .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) :5565-5572

[24] VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion [J].

Li, Yiming ;

Yu, Zhiding ;

Choy, Christopher ;

Xiao, Chaowei ;

Alvarez, Jose M. ;

Fidler, Sanja ;

Feng, Chen ;

Anandkumar, Anima .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :9087-9098

[25] Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection [J].

Li, Zhuoling ;

Qu, Zhan ;

Zhou, Yang ;

Liu, Jianzhuang ;

Wang, Haoqian ;

Jiang, Lihui .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :2781-2790

[26] Fine-Grained Multilevel Fusion for Anti-Occlusion Monocular 3D Object Detection [J].

Liu, He ;

Liu, Huaping ;

Wang, Yikai ;

Sun, Fuchun ;

Huang, Wenbing .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :4050-4061

[27]

Liu J., MULIMGVIEWER MULTIIM

[28]

Liu XP, 2022, AAAI CONF ARTIF INTE, P1810

[29] SMOKE: Single-Stage Monocular 3D Object Detection via Keypoint Estimation [J].

Liu, Zechen ;

Wu, Zizhang ;

Toth, Roland .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :4289-4298

[30] AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection [J].

Liu, Zongdai ;

Zhou, Dingfu ;

Lu, Feixiang ;

Fang, Jin ;

Zhang, Liangjun .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :15621-15630

← 1 2 3 4 5 6 →