MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection

被引:0
作者
Qiao, Junchao [1 ]
Liu, Biao [1 ]
Yang, Jiaqi [1 ]
Wang, Baohua [1 ]
Xiu, Sanmu [1 ]
Du, Xin [1 ]
Nie, Xiaobo [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Elect Engn & Automat, Beijing 100082, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 08期
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Training; Object detection; Data augmentation; Solid modeling; Uncertainty; Laser radar; Computer vision for transportation; deep learning for visual perception; object detection; VISION;
D O I
10.1109/LRA.2024.3414272
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the context of autonomous driving, it is both critical and challenging to locate 3D objects by using a calibrated RGB image. Current methods typically utilize heteroscedastic aleatoric uncertainty loss to regress the depth of objects, thereby reducing the impact of noisy input while also ensuring the reliability of depth predictions. However, experimentation reveals that uncertainty loss can also lead to serious overfitting issue and performance degradation. To address this issue, we propose MonoSample, an augmentation method that collects samples from the dataset and places them randomly during training. MonoSample takes into account the occlusion relationships and applies strict restrictions to ensure the verisimilitude of the enhanced scenes. Furthermore, MonoSample avoids the complex conversion process between 2D and 3D, thereby enabling the extraction of a large number of samples and efficient operation. Experiments on different models have verified its effectiveness. Leveraging MonoSample in DID-M3D, our model achieves state-of-the-art (SOTA) performance on the KITTI 3D object detection benchmark.
引用
收藏
页码:7326 / 7332
页数:7
相关论文
共 50 条
  • [31] MonoPoly: A practical monocular 3D object detector
    Guan, He
    Song, Chunfeng
    Zhang, Zhaoxiang
    Tan, Tieniu
    PATTERN RECOGNITION, 2022, 132
  • [32] Pattern-Aware Data Augmentation for LiDAR 3D Object Detection
    Hu, Jordan S. K.
    Was, Steven L.
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2703 - 2710
  • [33] Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection
    Cserni, Marton
    Rovid, Andras
    IEEE ACCESS, 2024, 12 : 167153 - 167167
  • [34] Investigating the Effectiveness of 3D Monocular Object Detection Methods for Roadside Scenarios
    Barra, Silvio
    Marras, Mirko
    Mohamed, Sondos
    Podda, Alessandro Sebastian
    Saia, Roberto
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 221 - 223
  • [35] Transformer3D-Det: Improving 3D Object Detection by Vote Refinement
    Zhao, Lichen
    Guo, Jinyang
    Xu, Dong
    Sheng, Lu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4735 - 4746
  • [36] Depth-Guided Vision Transformer With Normalizing Flows for Monocular 3D Object Detection
    Pan, Cong
    Peng, Junran
    Zhang, Zhaoxiang
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (03) : 673 - 689
  • [37] Disentangling Monocular 3D Object Detection: From Single to Multi-Class Recognition
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Antequera, Manuel Lopez
    Kontschieder, Peter
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1219 - 1231
  • [38] MonoAMNet: Three-Stage Real-Time Monocular 3D Object Detection With Adaptive Methods
    Pan, Huihui
    Jia, Yisong
    Wang, Jue
    Sun, Weichao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3574 - 3587
  • [39] Shape Prior Guided Instance Disparity Estimation for 3D Object Detection
    Chen, Linghao
    Sun, Jiaming
    Xie, Yiming
    Zhang, Siyu
    Shuai, Qing
    Jiang, Qinhong
    Zhang, Guofeng
    Bao, Hujun
    Zhou, Xiaowei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5529 - 5540
  • [40] MonoDFM: Density Field Modeling-Based End-to-End Monocular 3D Object Detection
    Liu, Gang
    Huang, Xinrui
    Xie, Xiaoxiao
    IEEE ACCESS, 2025, 13 : 74015 - 74031