3D Cascade RCNN: High Quality Object Detection in Point Clouds

被引:11
|
作者
Cai, Qi [1 ]
Pan, Yingwei [2 ]
Yao, Ting [2 ]
Mei, Tao [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230026, Peoples R China
[2] JD AI Res, Beijing 100105, Peoples R China
关键词
Three-dimensional displays; Proposals; Object detection; Point cloud compression; Detectors; Training; Task analysis; Point cloud; 3D object detection; cascade detection; sample re-weighting; R-CNN;
D O I
10.1109/TIP.2022.3201469
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress on 2D object detection has featured Cascade RCNN, which capitalizes on a sequence of cascade detectors to progressively improve proposal quality, towards high-quality object detection. However, there has not been evidence in support of building such cascade structures for 3D object detection, a challenging detection scenario with highly sparse LiDAR point clouds. In this work, we present a simple yet effective cascade architecture, named 3D Cascade RCNN, that allocates multiple detectors based on the voxelized point clouds in a cascade paradigm, pursuing higher quality 3D object detector progressively. Furthermore, we quantitatively define the sparsity level of the points within 3D bounding box of each object as the point completeness score, which is exploited as the task weight for each proposal to guide the learning of each stage detector. The spirit behind is to assign higher weights for high-quality proposals with relatively complete point distribution, while down-weight the proposals with extremely sparse points that often incur noise during training. This design of completeness-aware re-weighting elegantly upgrades the cascade paradigm to be better applicable for the sparse input data, without increasing any FLOP budgets. Through extensive experiments on both the KITTI dataset and Waymo Open Dataset, we validate the superiority of our proposed 3D Cascade RCNN, when comparing to state-of-the-art 3D object detection techniques. The source code is publicly available at https://github.com/caiqi/Cascasde-3D.
引用
收藏
页码:5706 / 5719
页数:14
相关论文
共 50 条
  • [1] Frustum PointVoxel-RCNN: A High-Performance Framework for Accurate 3D Object Detection in Point Clouds and Images
    Shao, Shilin
    Zhou, Yang
    Li, Zhenglin
    Xu, Wentai
    Chen, Guangtao
    Yuan, Tianxin
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS, ICCCR 2024, 2024, : 56 - 60
  • [2] Mask-SL RCNN: Feature-Enhanced 3D Object Detection Network for Point Clouds
    Zhong, Yuanhong
    Yang, Guangxia
    Deng, Dihang
    Tang, Panliang
    Ren, Fan
    IEEE PHOTONICS JOURNAL, 2023, 15 (05):
  • [3] P2V-RCNN: Point to Voxel Feature Learning for 3D Object Detection From Point Clouds
    Li, Jiale
    Sun, Yu
    Luo, Shujie
    Zhu, Ziqi
    Dai, Hang
    Krylov, Andrey S.
    Ding, Yong
    Shao, Ling
    IEEE ACCESS, 2021, 9 : 98249 - 98260
  • [4] SPHERERPN: LEARNING SPHERES FOR HIGH-QUALITY REGION PROPOSALS ON 3D POINT CLOUDS OBJECT DETECTION
    Vu, Thang
    Kim, Kookhoi
    Kang, Haeyong
    Xuan Thanh Nguyen
    Luu, Tung M.
    Yoo, Chang D.
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3173 - 3177
  • [5] SPS-RCNN: Semantic-Guided Proposal Sampling for 3D Object Detection from LiDAR Point Clouds
    Xu, Hengxin
    Yang, Lei
    Zhao, Shengya
    Tao, Shan
    Tian, Xinran
    Liu, Kun
    SENSORS, 2025, 25 (04)
  • [6] Knowledge guided object detection and identification in 3D Point Clouds
    Karmacharya, A.
    Boochs, F.
    Tietz, B.
    VIDEOMETRICS, RANGE IMAGING, AND APPLICATIONS XIII, 2015, 9528
  • [7] Deep Hough Voting for 3D Object Detection in Point Clouds
    Qi, Charles R.
    Litany, Or
    He, Kaiming
    Guibas, Leonidas J.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9276 - 9285
  • [8] 3D Object Detection with Normal-map on Point Clouds
    Miao, Jishu
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 569 - 576
  • [9] PG-RCNN: Semantic Surface Point Generation for 3D Object Detection
    Koo, Inyong
    Lee, Inyoung
    Kim, Se-Ho
    Kim, Hee-Seon
    Jeon, Woo-Jin
    Kim, Changick
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18096 - 18105
  • [10] Weakly Supervised Point Clouds Transformer for 3D Object Detection
    Tang, Zuojin
    Sun, Bo
    Ma, Tongwei
    Li, Daosheng
    Xu, Zhenhui
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3948 - 3955