MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection

被引:0
|
作者
Qiao, Junchao [1 ]
Liu, Biao [1 ]
Yang, Jiaqi [1 ]
Wang, Baohua [1 ]
Xiu, Sanmu [1 ]
Du, Xin [1 ]
Nie, Xiaobo [1 ]
机构
[1] Beijing Jiaotong Univ, Dept Elect Engn & Automat, Beijing 100082, Peoples R China
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 08期
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Training; Object detection; Data augmentation; Solid modeling; Uncertainty; Laser radar; Computer vision for transportation; deep learning for visual perception; object detection; VISION;
D O I
10.1109/LRA.2024.3414272
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the context of autonomous driving, it is both critical and challenging to locate 3D objects by using a calibrated RGB image. Current methods typically utilize heteroscedastic aleatoric uncertainty loss to regress the depth of objects, thereby reducing the impact of noisy input while also ensuring the reliability of depth predictions. However, experimentation reveals that uncertainty loss can also lead to serious overfitting issue and performance degradation. To address this issue, we propose MonoSample, an augmentation method that collects samples from the dataset and places them randomly during training. MonoSample takes into account the occlusion relationships and applies strict restrictions to ensure the verisimilitude of the enhanced scenes. Furthermore, MonoSample avoids the complex conversion process between 2D and 3D, thereby enabling the extraction of a large number of samples and efficient operation. Experiments on different models have verified its effectiveness. Leveraging MonoSample in DID-M3D, our model achieves state-of-the-art (SOTA) performance on the KITTI 3D object detection benchmark.
引用
收藏
页码:7326 / 7332
页数:7
相关论文
共 50 条
  • [1] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [2] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [3] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [4] Monocular 3D Object Detection With Sequential Feature Association and Depth Hint Augmentation
    Gao, Tianze
    Pan, Huihui
    Gao, Huijun
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (02): : 240 - 250
  • [5] Monocular 3D Object Detection Utilizing Auxiliary Learning With Deformable Convolution
    Chen, Jiun-Han
    Shieh, Jeng-Lun
    Haq, Muhamad Amirul
    Ruan, Shanq-Jang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (03) : 2424 - 2436
  • [6] GUPNet plus plus : Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
    Lu, Yan
    Ma, Xinzhu
    Yang, Lei
    Zhang, Tianzhu
    Liu, Yating
    Chu, Qi
    He, Tong
    Li, Yonghui
    Ouyang, Wanli
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 900 - 915
  • [7] RS-Aug: Improve 3D Object Detection on LiDAR With Realistic Simulator Based Data Augmentation
    An, Pei
    Liang, Junxiong
    Ma, Jie
    Chen, Yanfei
    Wang, Liheng
    Yang, You
    Liu, Qiong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 10165 - 10176
  • [8] MonoEF: Extrinsic Parameter Free Monocular 3D Object Detection
    Zhou, Yunsong
    He, Yuan
    Zhu, Hongzi
    Wang, Cheng
    Li, Hongyang
    Jiang, Qinhong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10114 - 10128
  • [9] You Only Look Bottom-Up for Monocular 3D Object Detection
    Xiong, Kaixin
    Zhang, Dingyuan
    Liang, Dingkang
    Liu, Zhe
    Yang, Hongcheng
    Dikubab, Wondimu
    Cheng, Jianwei
    Bai, Xiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7464 - 7471
  • [10] MonoGRNet: A General Framework for Monocular 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184