DDet3D: embracing 3D object detector with diffusionDDet3D: embracing 3D object detector with diffusionG.K. Erabati and H. Araujo

被引:0
|
作者
Gopi Krishna Erabati [1 ]
Helder Araujo [1 ]
机构
[1] University of Coimbra,Institute of Systems and Robotics
关键词
3D object detection; Diffusion; LiDAR; Autonomous driving; Computer vision;
D O I
10.1007/s10489-024-06045-1
中图分类号
学科分类号
摘要
Existing approaches rely on heuristic or learnable object proposals (which are required to be optimised during training) for 3D object detection. In our approach, we replace the hand-crafted or learnable object proposals with randomly generated object proposals by formulating a new paradigm to employ a diffusion model to detect 3D objects from a set of randomly generated and supervised learning-based object proposals in an autonomous driving application. We propose DDet3D, a diffusion-based 3D object detection framework that formulates 3D object detection as a generative task over the 3D bounding box coordinates in 3D space. To our knowledge, this work is the first to formulate the 3D object detection with denoising diffusion model and to establish that 3D randomly generated and supervised learning-based proposals (different from empirical anchors or learnt queries) are also potential object candidates for 3D object detection. During training, the 3D random noisy boxes are employed from the 3D ground truth boxes by progressively adding Gaussian noise, and the DDet3D network is trained to reverse the diffusion process. During the inference stage, the DDet3D network is able to iteratively refine the 3D randomly generated and supervised learning-based noisy boxes to predict 3D bounding boxes conditioned on the LiDAR Bird’s Eye View (BEV) features. The advantage of DDet3D is that it allows to decouple training and inference stages, thus enabling the use of a larger number of proposal boxes or sampling steps during inference to improve accuracy. We conduct extensive experiments and analysis on the nuScenes and KITTI datasets. DDet3D achieves competitive performance compared to well-designed 3D object detectors. Our work serves as a strong baseline to explore and employ more efficient diffusion models for 3D perception tasks.
引用
收藏
相关论文
共 50 条
  • [21] Cross-Dataset Sensor Alignment: Making Visual 3D Object Detector Generalizable
    Zheng, Liangtao
    Liu, Yicheng
    Wang, Yue
    Zhao, Hang
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [22] Enhance the 3D Object Detection With 2D Prior
    Liu, Cheng
    IEEE ACCESS, 2024, 12 : 67161 - 67169
  • [23] 3D Mask-Based Shape Loss Function for LIDAR Data for Improved 3D Object Detection
    Park, R.
    Lee, C.
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS, VEHITS 2023, 2023, : 305 - 312
  • [24] CSANet: Cuboid-Wise Shape Augmentation 3D Object Detector for Occluded Targets
    Lin, Jia
    Ge, Wancheng
    Rigoll, Gerhard
    Yin, Huilin
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1750 - 1754
  • [25] 2D&3DHNet for 3D Object Classification in LiDAR Point Cloud
    Song, Wei
    Li, Dechao
    Sun, Su
    Zhang, Lingfeng
    Xin, Yu
    Sung, Yunsick
    Choi, Ryong
    REMOTE SENSING, 2022, 14 (13)
  • [26] AEC3D: An Efficient and Compact Single Stage 3D Multiobject Detector for Autonomous Driving
    Hoang Duy Loc
    Kim, Gon-Woo
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 23422 - 23432
  • [27] Multimodal 3D Object Detection from Simulated Pretraining
    Brekke, Asmund
    Vatsendvik, Fredrik
    Lindseth, Frank
    NORDIC ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2019, 1056 : 102 - 113
  • [28] 3D Object Detection for Autonomous Driving: A Comprehensive Survey
    Mao, Jiageng
    Shi, Shaoshuai
    Wang, Xiaogang
    Li, Hongsheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 1909 - 1963
  • [29] 3D object detection algorithms in autonomous driving: A review
    Ren K.-Y.
    Gu M.-Y.
    Yuan Z.-Q.
    Yuan S.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (04): : 865 - 889
  • [30] 3D Object Detection for Autonomous Driving: A Comprehensive Survey
    Jiageng Mao
    Shaoshuai Shi
    Xiaogang Wang
    Hongsheng Li
    International Journal of Computer Vision, 2023, 131 : 1909 - 1963