MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection

被引：0

作者：

Qiao, Junchao ^{[1
]}

Liu, Biao ^{[1
]}

Yang, Jiaqi ^{[1
]}

Wang, Baohua ^{[1
]}

Xiu, Sanmu ^{[1
]}

Du, Xin ^{[1
]}

Nie, Xiaobo ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, Dept Elect Engn & Automat, Beijing 100082, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Training; Object detection; Data augmentation; Solid modeling; Uncertainty; Laser radar; Computer vision for transportation; deep learning for visual perception; object detection; VISION;

D O I：

10.1109/LRA.2024.3414272

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

In the context of autonomous driving, it is both critical and challenging to locate 3D objects by using a calibrated RGB image. Current methods typically utilize heteroscedastic aleatoric uncertainty loss to regress the depth of objects, thereby reducing the impact of noisy input while also ensuring the reliability of depth predictions. However, experimentation reveals that uncertainty loss can also lead to serious overfitting issue and performance degradation. To address this issue, we propose MonoSample, an augmentation method that collects samples from the dataset and places them randomly during training. MonoSample takes into account the occlusion relationships and applies strict restrictions to ensure the verisimilitude of the enhanced scenes. Furthermore, MonoSample avoids the complex conversion process between 2D and 3D, thereby enabling the extraction of a large number of samples and efficient operation. Experiments on different models have verified its effectiveness. Leveraging MonoSample in DID-M3D, our model achieves state-of-the-art (SOTA) performance on the KITTI 3D object detection benchmark.

引用

页码：7326 / 7332

页数：7

共 50 条

[1] Aerial Monocular 3D Object Detection
Hu, Yue
Fang, Shaoheng
Xie, Weidi
Chen, Siheng
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
[2] Monocular 3D Object Detection With Motion Feature Distillation
Hu, Henan
Li, Muyu
Zhu, Ming
Gao, Wen
Liu, Peiyu
Chan, Kwok-Leung
IEEE ACCESS, 2023, 11 : 82933 - 82945
[3] Shape-Aware Monocular 3D Object Detection
Chen, Wei
Zhao, Jie
Zhao, Wan-Lei
Wu, Song-Yuan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
[4] Monocular 3D Object Detection With Sequential Feature Association and Depth Hint Augmentation
Gao, Tianze
Pan, Huihui
Gao, Huijun
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2022, 7 (02): : 240 - 250
[5] Monocular 3D Object Detection Utilizing Auxiliary Learning With Deformable Convolution
Chen, Jiun-Han
Shieh, Jeng-Lun
Haq, Muhamad Amirul
Ruan, Shanq-Jang
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (03) : 2424 - 2436
[6] GUPNet plus plus : Geometry Uncertainty Propagation Network for Monocular 3D Object Detection
Lu, Yan
Ma, Xinzhu
Yang, Lei
Zhang, Tianzhu
Liu, Yating
Chu, Qi
He, Tong
Li, Yonghui
Ouyang, Wanli
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (02) : 900 - 915
[7] RS-Aug: Improve 3D Object Detection on LiDAR With Realistic Simulator Based Data Augmentation
An, Pei
Liang, Junxiong
Ma, Jie
Chen, Yanfei
Wang, Liheng
Yang, You
Liu, Qiong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (09) : 10165 - 10176
[8] MonoEF: Extrinsic Parameter Free Monocular 3D Object Detection
Zhou, Yunsong
He, Yuan
Zhu, Hongzi
Wang, Cheng
Li, Hongyang
Jiang, Qinhong
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10114 - 10128
[9] You Only Look Bottom-Up for Monocular 3D Object Detection
Xiong, Kaixin
Zhang, Dingyuan
Liang, Dingkang
Liu, Zhe
Yang, Hongcheng
Dikubab, Wondimu
Cheng, Jianwei
Bai, Xiang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7464 - 7471
[10] MonoGRNet: A General Framework for Monocular 3D Object Detection
Qin, Zengyi
Wang, Jinglu
Lu, Yan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5170 - 5184

← 1 2 3 4 5 →