Lidar Point Cloud Guided Monocular 3D Object Detection

被引:23
作者
Peng, Liang [1 ,2 ]
Liu, Fei
Yu, Zhengxu [1 ]
Yan, Senbo [1 ,2 ]
Deng, Dan [2 ]
Yang, Zheng [2 ]
Liu, Haifeng [1 ]
Cai, Deng [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab CAD & CG, Hangzhou, Peoples R China
[2] Fabu Inc, Hangzhou, Peoples R China
来源
COMPUTER VISION - ECCV 2022, PT I | 2022年 / 13661卷
关键词
Monocular 3D detection; LiDAR point cloud; Self-driving;
D O I
10.1007/978-3-031-19769-7_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular 3D object detection is a challenging task in the self-driving and computer vision community. As a common practice, most previous works use manually annotated 3D box labels, where the annotating process is expensive. In this paper, we find that the precisely and carefully annotated labels may be unnecessary in monocular 3D detection, which is an interesting and counterintuitive finding. Using rough labels that are randomly disturbed, the detector can achieve very close accuracy compared to the one using the ground-truth labels. We delve into this underlying mechanism and then empirically find that: concerning the label accuracy, the 3D location part in the label is preferred compared to other parts of labels. Motivated by the conclusions above and considering the precise LiDAR 3D measurement, we propose a simple and effective framework, dubbed LiDAR point cloud guided monocular 3D object detection (LPCG). This framework is capable of either reducing the annotation costs or considerably boosting the detection accuracy without introducing extra annotation costs. Specifically, It generates pseudo labels from unlabeled LiDAR point clouds. Thanks to accurate LiDAR 3D measurements in 3D space, such pseudo labels can replace manually annotated labels in the training of monocular 3D detectors, since their 3D location information is precise. LPCG can be applied into any monocular 3D detector to fully use massive unlabeled data in a selfdriving system. As a result, in KITTI benchmark, we take the first place on both monocular 3D and BEV (bird's-eye-view) detection with a significant margin. In Waymo benchmark, our method using 10% labeled data achieves comparable accuracy to the baseline detector using 100% labeled data. The codes are released at https://github.com/SPengLiang/LPCG.
引用
收藏
页码:123 / 139
页数:17
相关论文
共 50 条
  • [31] 3D Vehicle Detection Based on LiDAR and Camera Fusion
    Yingfeng Cai
    Tiantian Zhang
    Hai Wang
    Yicheng Li
    Qingchao Liu
    Xiaobo Chen
    Automotive Innovation, 2019, 2 : 276 - 283
  • [32] 3D Vehicle Detection Based on LiDAR and Camera Fusion
    Cai, Yingfeng
    Zhang, Tiantian
    Wang, Hai
    Li, Yicheng
    Liu, Qingchao
    Chen, Xiaobo
    AUTOMOTIVE INNOVATION, 2019, 2 (04) : 276 - 283
  • [33] Depth Representation of LiDAR Point Cloud with Adaptive Surface Patching for Object Classification
    Lertniphonphan, Kanokphan
    Komorita, Satoshi
    Tasaka, Kazuyuki
    Yanagihara, Hiromasa
    MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 367 - 371
  • [34] Robust Normal Estimation for 3D LiDAR Point Clouds in Urban Environments
    Zhao, Ruibin
    Pang, Mingyong
    Liu, Caixia
    Zhang, Yanling
    SENSORS, 2019, 19 (05)
  • [35] Text2LiDAR: Text-Guided LiDAR Point Cloud Generation via Equirectangular Transformer
    Wu, Yang
    Zhang, Kaihua
    Qian, Jianjun
    Xie, Jin
    Yang, Jian
    COMPUTER VISION - ECCV 2024, PT LVI, 2025, 15114 : 291 - 310
  • [36] SAT3D: Slot Attention Transformer for 3D Point Cloud Semantic Segmentation
    Ibrahim, Muhammad
    Akhtar, Naveed
    Anwar, Saeed
    Mian, Ajmal
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 5456 - 5466
  • [37] 3D-SiamRPN: An End-to-End Learning Method for Real-Time 3D Single Object Tracking Using Raw Point Cloud
    Fang, Zheng
    Zhou, Sifan
    Cui, Yubo
    Scherer, Sebastian
    IEEE SENSORS JOURNAL, 2021, 21 (04) : 4995 - 5011
  • [38] 3D object detection network based on symmetric shape generation
    Tu X.
    Zheng S.
    Yu S.
    Li W.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (06): : 252 - 263
  • [39] CenterFormer: Center-Based Transformer for 3D Object Detection
    Zhou, Zixiang
    Zhao, Xiangchen
    Wang, Yu
    Wang, Panqu
    Foroosh, Hassan
    COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 496 - 513
  • [40] DENSITY-BASED METHOD FOR BUILDING DETECTION FROM LiDAR POINT CLOUD
    Mahphood, A.
    Arefi, H.
    ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 423 - 428