MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection

被引：0

作者：

Qiao, Junchao ^{[1
]}

Liu, Biao ^{[1
]}

Yang, Jiaqi ^{[1
]}

Wang, Baohua ^{[1
]}

Xiu, Sanmu ^{[1
]}

Du, Xin ^{[1
]}

Nie, Xiaobo ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Dept Elect Engn & Automat, Beijing 100082, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Training; Object detection; Data augmentation; Solid modeling; Uncertainty; Laser radar; Computer vision for transportation; deep learning for visual perception; object detection; VISION;

D O I：

10.1109/LRA.2024.3414272

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In the context of autonomous driving, it is both critical and challenging to locate 3D objects by using a calibrated RGB image. Current methods typically utilize heteroscedastic aleatoric uncertainty loss to regress the depth of objects, thereby reducing the impact of noisy input while also ensuring the reliability of depth predictions. However, experimentation reveals that uncertainty loss can also lead to serious overfitting issue and performance degradation. To address this issue, we propose MonoSample, an augmentation method that collects samples from the dataset and places them randomly during training. MonoSample takes into account the occlusion relationships and applies strict restrictions to ensure the verisimilitude of the enhanced scenes. Furthermore, MonoSample avoids the complex conversion process between 2D and 3D, thereby enabling the extraction of a large number of samples and efficient operation. Experiments on different models have verified its effectiveness. Leveraging MonoSample in DID-M3D, our model achieves state-of-the-art (SOTA) performance on the KITTI 3D object detection benchmark.

引用

页码：7326 / 7332

页数：7

共 50 条

[31] MonoPoly: A practical monocular 3D object detector
Guan, He
Song, Chunfeng
Zhang, Zhaoxiang
Tan, Tieniu
PATTERN RECOGNITION, 2022, 132
[32] Pattern-Aware Data Augmentation for LiDAR 3D Object Detection
Hu, Jordan S. K.
Was, Steven L.
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2703 - 2710
[33] Semantic Shape and Trajectory Reconstruction for Monocular Cooperative 3D Object Detection
Cserni, Marton
Rovid, Andras
IEEE ACCESS, 2024, 12 : 167153 - 167167
[34] Investigating the Effectiveness of 3D Monocular Object Detection Methods for Roadside Scenarios
Barra, Silvio
Marras, Mirko
Mohamed, Sondos
Podda, Alessandro Sebastian
Saia, Roberto
39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 221 - 223
[35] Transformer3D-Det: Improving 3D Object Detection by Vote Refinement
Zhao, Lichen
Guo, Jinyang
Xu, Dong
Sheng, Lu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4735 - 4746
[36] Depth-Guided Vision Transformer With Normalizing Flows for Monocular 3D Object Detection
Pan, Cong
Peng, Junran
Zhang, Zhaoxiang
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (03) : 673 - 689
[37] Disentangling Monocular 3D Object Detection: From Single to Multi-Class Recognition
Simonelli, Andrea
Bulo, Samuel Rota
Porzi, Lorenzo
Antequera, Manuel Lopez
Kontschieder, Peter
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1219 - 1231
[38] MonoAMNet: Three-Stage Real-Time Monocular 3D Object Detection With Adaptive Methods
Pan, Huihui
Jia, Yisong
Wang, Jue
Sun, Weichao
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3574 - 3587
[39] Shape Prior Guided Instance Disparity Estimation for 3D Object Detection
Chen, Linghao
Sun, Jiaming
Xie, Yiming
Zhang, Siyu
Shuai, Qing
Jiang, Qinhong
Zhang, Guofeng
Bao, Hujun
Zhou, Xiaowei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5529 - 5540
[40] MonoDFM: Density Field Modeling-Based End-to-End Monocular 3D Object Detection
Liu, Gang
Huang, Xinrui
Xie, Xiaoxiao
IEEE ACCESS, 2025, 13 : 74015 - 74031

← 1 2 3 4 5 →