MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection

被引:0
作者
Qiao, Junchao [1 ]
Liu, Biao [1 ]
Yang, Jiaqi [1 ]
Wang, Baohua [1 ]
Xiu, Sanmu [1 ]
Du, Xin [1 ]
Nie, Xiaobo [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Elect Engn & Automat, Beijing 100082, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 08期
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Training; Object detection; Data augmentation; Solid modeling; Uncertainty; Laser radar; Computer vision for transportation; deep learning for visual perception; object detection; VISION;
D O I
10.1109/LRA.2024.3414272
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the context of autonomous driving, it is both critical and challenging to locate 3D objects by using a calibrated RGB image. Current methods typically utilize heteroscedastic aleatoric uncertainty loss to regress the depth of objects, thereby reducing the impact of noisy input while also ensuring the reliability of depth predictions. However, experimentation reveals that uncertainty loss can also lead to serious overfitting issue and performance degradation. To address this issue, we propose MonoSample, an augmentation method that collects samples from the dataset and places them randomly during training. MonoSample takes into account the occlusion relationships and applies strict restrictions to ensure the verisimilitude of the enhanced scenes. Furthermore, MonoSample avoids the complex conversion process between 2D and 3D, thereby enabling the extraction of a large number of samples and efficient operation. Experiments on different models have verified its effectiveness. Leveraging MonoSample in DID-M3D, our model achieves state-of-the-art (SOTA) performance on the KITTI 3D object detection benchmark.
引用
收藏
页码:7326 / 7332
页数:7
相关论文
共 50 条
  • [21] Super Sparse 3D Object Detection
    Fan, Lue
    Yang, Yuxue
    Wang, Feng
    Wang, Naiyan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12490 - 12505
  • [22] UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection With Sparse LiDAR and Large Domain Gaps
    Wozniak, Maciej K.
    Hansson, Mattias
    Thiel, Marko
    Jensfelt, Patric
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (12): : 11210 - 11217
  • [23] Weakly Supervised Monocular 3D Object Detection by Spatial-Temporal View Consistency
    Han, Wencheng
    Tao, Runzhou
    Ling, Haibin
    Shen, Jianbing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) : 84 - 98
  • [24] Depth Estimation From Surface-Ground Correspondence for Monocular 3D Object Detection
    Ji, Yinshuai
    Xu, Jinhua
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16312 - 16322
  • [25] YOLOv7-3D: A Monocular 3D Traffic Object Detection Method from a Roadside Perspective
    Ye, Zixun
    Zhang, Hongying
    Gu, Jingliang
    Li, Xue
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [26] SMS3D: 3D Synthetic Mushroom Scenes Dataset for 3D Object Detection and Pose Estimation
    Zakeri, Abdollah
    Koirala, Bikram
    Kang, Jiming
    Balan, Venkatesh
    Zhu, Weihang
    Benhaddou, Driss
    Merchant, Fatima A.
    COMPUTERS, 2025, 14 (04)
  • [27] SGM3D: Stereo Guided Monocular 3D Object Detection
    Zhou, Zheyuan
    Du, Liang
    Ye, Xiaoqing
    Zou, Zhikang
    Tan, Xiao
    Zhang, Li
    Xue, Xiangyang
    Feng, Jianfeng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10478 - 10485
  • [28] Deep 3D Object Detection Networks Using LiDAR Data: A Review
    Wu, Yutian
    Wang, Yueyu
    Zhang, Shuwei
    Ogai, Harutoshi
    IEEE SENSORS JOURNAL, 2021, 21 (02) : 1152 - 1171
  • [29] Fine-Grained Multilevel Fusion for Anti-Occlusion Monocular 3D Object Detection
    Liu, He
    Liu, Huaping
    Wang, Yikai
    Sun, Fuchun
    Huang, Wenbing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4050 - 4061
  • [30] CONTEXT-AWARE DATA AUGMENTATION FOR LIDAR 3D OBJECT DETECTION
    Hu, Xuzhong
    Duan, Zaipeng
    Huang, Xiao
    Xu, Ziwen
    Ming, Delie
    Ma, Jie
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 11 - 15