M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

被引:392
作者
Brazil, Garrick [1 ]
Liu, Xiaoming [1 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00938
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding the world in 3D is a critical component of urban autonomous driving. Generally, the combination of expensive LiDAR sensors and stereo RGB imaging has been paramount for successful 3D object detection algorithms, whereas monocular image-only methods experience drastically reduced performance. We propose to reduce the gap by reformulating the monocular 3D detection problem as a standalone 3D region proposal network. We leverage the geometric relationship of 2D and 3D perspectives, allowing 3D boxes to utilize well-known and powerful convolutional features generated in the image-space. To help address the strenuous 3D parameter estimations, we further design depth-aware convolutional layers which enable location specific feature development and in consequence improved 3D scene understanding. Compared to prior work in monocular 3D detection, our method consists of only the proposed 3D region proposal network rather than relying on external networks, data, or multiple stages. M3D-RPN is able to significantly improve the performance of both monocular 3D Object Detection and Bird's Eye View tasks within the KITTI urban autonomous driving dataset, while efficiently using a shared multi-class model.
引用
收藏
页码:9286 / 9295
页数:10
相关论文
共 50 条
[21]   Multivariate Probabilistic Monocular 3D Object Detection [J].
Shi, Xuepeng ;
Chen, Zhixiang ;
Kim, Tae-Kyun .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :4270-4279
[22]   Homography Loss for Monocular 3D Object Detection [J].
Gu, Jiaqi ;
Wu, Bojian ;
Fan, Lubin ;
Huang, Jianqiang ;
Cao, Shen ;
Xiang, Zhiyu ;
Hua, Xian-Sheng .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1070-1079
[23]   Monocular 3D object detection for distant objects [J].
Li, Jiahao ;
Han, Xiaohong .
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) :33021
[24]   Dp-M3D: Monocular 3D object detection algorithm with depth perception capability [J].
Shi, Peicheng ;
Dong, Xinlong ;
Ge, Runshuai ;
Liu, Zhiqiang ;
Yang, Aixi .
KNOWLEDGE-BASED SYSTEMS, 2025, 318
[25]   DEPTH-ASSISTED JOINT DETECTION NETWORK FOR MONOCULAR 3D OBJECT DETECTION [J].
Lei, Jianjun ;
Guo, Tingyi ;
Peng, Bo ;
Yu, Chuanbo .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :2204-2208
[26]   Triangulation Learning Network: from Monocular to Stereo 3D Object Detection [J].
Qin, Zengyi ;
Wang, Jinglu ;
Lu, Yan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7607-7615
[27]   Deep Fitting Degree Scoring Network for Monocular 3D Object Detection [J].
Liu, Lijie ;
Lu, Jiwen ;
Xu, Chunjing ;
Tian, Qi ;
Zhou, Jie .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1057-1066
[28]   LAM3D: Leveraging Attention for Monocular 3D Object Detection [J].
Sas, Diana-Alexandra ;
Di Bella, Leandro ;
Lyu, Yangxintong ;
Oniga, Florin ;
Munteanu, Adrian .
2024 IEEE 26TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2024,
[29]   SGM3D: Stereo Guided Monocular 3D Object Detection [J].
Zhou, Zheyuan ;
Du, Liang ;
Ye, Xiaoqing ;
Zou, Zhikang ;
Tan, Xiao ;
Zhang, Li ;
Xue, Xiangyang ;
Feng, Jianfeng .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :10478-10485
[30]   RoadSense3D: A Framework for Roadside Monocular 3D Object Detection [J].
Carta, Salvatore ;
Castrillon-Santana, Modesto ;
Marras, Mirko ;
Mohamed, Sondos ;
Podda, Alessandro Sebastian ;
Saia, Roberto ;
Sau, Marco ;
Zimmer, Walter .
ADJUNCT PROCEEDINGS OF THE 32ND ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2024, 2024, :452-459