Deep Learning-Based Monocular 3D Object Detection with Refinement of Depth Information

被引:7
|
作者
Hu, Henan [1 ,2 ,3 ]
Zhu, Ming [1 ]
Li, Muyu [4 ]
Chan, Kwok-Leung [3 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[4] Ctr Intelligent Multidimens Data Anal Ltd, Hong Kong, Peoples R China
关键词
3D object detection; monocular image; point cloud; deep learning; depth estimation; autonomous driving; NETWORK;
D O I
10.3390/s22072576
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Recently, the research on monocular 3D target detection based on pseudo-LiDAR data has made some progress. In contrast to LiDAR-based algorithms, the robustness of pseudo-LiDAR methods is still inferior. After conducting in-depth experiments, we realized that the main limitations are due to the inaccuracy of the target position and the uncertainty in the depth distribution of the foreground target. These two problems arise from the inaccurate depth estimation. To deal with the aforementioned problems, we propose two innovative solutions. The first is a novel method based on joint image segmentation and geometric constraints, used to predict the target depth and provide the depth prediction confidence measure. The predicted target depth is fused with the overall depth of the scene and results in the optimal target position. For the second, we utilize the target scale, normalized with the Gaussian function, as a priori information. The uncertainty of depth distribution, which can be visualized as long-tail noise, is reduced. With the refined depth information, we convert the optimized depth map into the point cloud representation, called a pseudo-LiDAR point cloud. Finally, we input the pseudo-LiDAR point cloud to the LiDAR-based algorithm to detect the 3D target. We conducted extensive experiments on the challenging KITTI dataset. The results demonstrate that our proposed framework outperforms various state-of-the-art methods by more than 12.37% and 5.34% on the easy and hard settings of the KITTI validation subset, respectively. On the KITTI test set, our framework also outperformed state-of-the-art methods by 5.1% and 1.76% on the easy and hard settings, respectively.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [32] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999
  • [33] Monocular 3D Object Detection Based on Pseudo Multimodal Information Extraction and Keypoint Estimation
    Zhao, Dan
    Ji, Chaofeng
    Liu, Guizhong
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [34] Monocular 3D object detection for occluded targets based on spatial relationships and decoupled depth predictions
    Gao, Yanfei
    Miao, Xiongwei
    Zhang, Guoye
    FRONTIERS IN COMPUTER SCIENCE, 2025, 6
  • [35] DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection
    Peng, Liang
    Wu, Xiaopei
    Yang, Zheng
    Liu, Haifeng
    Cai, Deng
    COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 71 - 88
  • [36] A Mobile 3-D Object Recognition Processor With Deep-Learning-Based Monocular Depth Estimation
    Im, Dongseok
    Park, Gwangtae
    Li, Zhiyong
    Ryu, Junha
    Kang, Sanghoon
    Han, Donghyeon
    Lee, Jinsu
    Park, Wonhoon
    Kwon, Hankyul
    Yoo, Hoi-Jun
    IEEE MICRO, 2023, 43 (03) : 74 - 82
  • [37] Deep Fitting Degree Scoring Network for Monocular 3D Object Detection
    Liu, Lijie
    Lu, Jiwen
    Xu, Chunjing
    Tian, Qi
    Zhou, Jie
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1057 - 1066
  • [38] Analysis of object detection accuracy based on the density of 3D point clouds for deep learning-based shipyard datasets
    Jung, Ki-Seok
    Lee, Dong-Kun
    INTERNATIONAL JOURNAL OF NAVAL ARCHITECTURE AND OCEAN ENGINEERING, 2025, 17
  • [39] MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation
    Zhou, Yunsong
    Liu, Quan
    Zhu, Hongzi
    Li, Yunzhe
    Chang, Shan
    Guo, Minyi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [40] Monocular 3D Object Detection With Sequential Feature Association and Depth Hint Augmentation
    Gao, Tianze
    Pan, Huihui
    Gao, Huijun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (02): : 240 - 250