Occlusion-Aware Plane-Constraints for Monocular 3D Object Detection

被引：8

作者：

Yao, Hongdou ^{[1
]}

Chen, Jun ^{[1
]}

Wang, Zheng ^{[1
]}

Wang, Xiao ^{[2
]}

Han, Pengfei ^{[3
]}

Chai, Xiaoyu ^{[1
]}

Qiu, Yansheng ^{[1
]}

机构：

[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Peoples R China

[2] Wuhan Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430072, Peoples R China

[3] Northwestern Polytech Univ, Sch Cybersecur, Xian 710000, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 05期

关键词：

3D object detection; monocular image; occlusion object; keypoints sampling; adaptive plane constraints;

D O I：

10.1109/TITS.2023.3323036

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

The task of 3D object detection poses a significant challenge for 3D scene understanding and is primarily employed in the fields of robot control and autonomous driving. Monocular-based 3D detection methods are more cost-effective and practical than stereo-based or LiDAR-based methods. Monocular image 3D detection methods have garnered considerable attention from researchers. However, the impact of occlusion scenarios of the objects on the keypoints prediction is often overlooked. To address this issue, the present paper proposes a novel 3D monocular object detection method named MonOAPC, which is equipped with occlusion-aware plane-constraints. This method can adaptively utilize partial keypoints to infer the plane location of the object based on the level of occlusion and highlights that the introduction of plane constraints is advantageous for the 3D detection task. First, the plane information of the object in 3D space is beneficial to optimize the keypoints regression, and considering that each plane holds different significance, an adaptive plane location inference module is proposed to enhance the keypoints location regression. Second, a novel co-depth estimation module is proposed to jointly estimate the object's spatial location through various depth inference methods, thereby improving the generalization of the object depth estimation. Furthermore, the paper demonstrates that the accuracy of 3D object detection can be indirectly improved by introducing plane information to promote keypoints regression, and that plane information is effective for monocular 3D detection. The experimental outcomes show that the MonOAPC method can attain competitive results.

引用

页码：4593 / 4605

页数：13

共 56 条

[41] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud [J].

Shi, Shaoshuai ;

Wang, Xiaogang ;

Li, Hongsheng .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :770-779

[42] Geometry-based Distance Decomposition for Monocular 3D Object Detection [J].

Shi, Xuepeng ;

Ye, Qi ;

Chen, Xiaozhi ;

Chen, Chuangrong ;

Chen, Zhixiang ;

Kim, Tae-Kyun .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :15152-15161

[43]

Simonelli A., 2020, LECT NOTES COMPUTER, V12367

[44] Disentangling Monocular 3D Object Detection [J].

Simonelli, Andrea ;

Bulo, Samuel Rota ;

Porzi, Lorenzo ;

Lopez-Antequera, Manuel ;

Kontschieder, Peter .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1991-1999

[45] ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving [J].

Song, Xibin ;

Wang, Peng ;

Zhou, Dingfu ;

Zhu, Rui ;

Guan, Chenye ;

Dai, Yuchao ;

Su, Hao ;

Li, Hongdong ;

Yang, Ruigang .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5447-5457

[46] FCOS: Fully Convolutional One-Stage Object Detection [J].

Tian, Zhi ;

Shen, Chunhua ;

Chen, Hao ;

He, Tong .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9626-9635

[47] Focal Loss for Dense Object Detection [J].

Lin, Tsung-Yi ;

Goyal, Priya ;

Girshick, Ross ;

He, Kaiming ;

Dollar, Piotr .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007

[48]

Vianney J. M. U., 2019, ARXIV

[49]

Wang XL, 2020, AAAI CONF ARTIF INTE, V34, P12257

[50] UnOS: Unified Unsupervised Optical-flow and Stereo-depth Estimation by Watching Videos [J].

Wang, Yang ;

Wang, Peng ;

Yang, Zhenheng ;

Luo, Chenxu ;

Yang, Yi ;

Xu, Wei .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8063-8073

← 1 2 3 4 5 6 →