GRAMO: geometric resampling augmentation for monocular 3D object detection

被引:0
作者
He Guan
Chunfeng Song
Zhaoxiang Zhang
机构
[1] University of Chinese Academy of Sciences,School of Artificial Intelligence
[2] Institute of Automation Chinese Academy of Sciences,Center for Research on Intelligent Perception and Computing, State Key Laboratory of Multimodal Artificial Intelligence Systems
来源
Frontiers of Computer Science | 2024年 / 18卷
关键词
3D detection; monocular; augmentation; geometry;
D O I
暂无
中图分类号
学科分类号
摘要
Data augmentation is widely recognized as an effective means of bolstering model robustness. However, when applied to monocular 3D object detection, non-geometric image augmentation neglects the critical link between the image and physical space, resulting in the semantic collapse of the extended scene. To address this issue, we propose two geometric-level data augmentation operators named Geometric-Copy-Paste (Geo-CP) and Geometric-Crop-Shrink (Geo-CS). Both operators introduce geometric consistency based on the principle of perspective projection, complementing the options available for data augmentation in monocular 3D. Specifically, Geo-CP replicates local patches by reordering object depths to mitigate perspective occlusion conflicts, and Geo-CS re-crops local patches for simultaneous scaling of distance and scale to unify appearance and annotation. These operations ameliorate the problem of class imbalance in the monocular paradigm by increasing the quantity and distribution of geometrically consistent samples. Experiments demonstrate that our geometric-level augmentation operators effectively improve robustness and performance in the KITTI and Waymo monocular 3D detection benchmarks.
引用
收藏
相关论文
共 50 条
[41]   Monocular digital image correlation 3D panoramic measurement based on plane mirror imaging [J].
Ge, Pengxiang ;
Zhang, Qian ;
Gao, Haoran .
INTERNATIONAL CONFERENCE ON OPTICAL AND PHOTONIC ENGINEERING, ICOPEN 2024, 2025, 13509
[42]   xCloth: Extracting Template-free Textured 3D Clothes from a Monocular Image [J].
Srivastava, Astitva ;
Pokhariya, Chandradeep ;
Jinka, Sai Sagar ;
Sharma, Avinash .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :2504-2512
[43]   Design and Evaluation of an Augmented Reality App for Learning Geometric Shapes in 3D [J].
Thamrongrat, Pornpon ;
Law, Effie Lai-Chong .
HUMAN-COMPUTER INTERACTION - INTERACT 2019, PT IV, 2019, 11749 :364-385
[44]   A Plastic, Dynamic and Reducible 3D Geometric Model for Simulating Gramineous Leaves [J].
Fournier, Christian ;
Pradal, Christophe .
2012 IEEE FOURTH INTERNATIONAL SYMPOSIUM ON PLANT GROWTH MODELING, SIMULATION, VISUALIZATION AND APPLICATIONS (PMA), 2012, :125-132
[45]   3D human pose estimation from a single image via exemplar augmentation [J].
Yang, Jingjing ;
Wan, Lili ;
Xu, Wanru ;
Wang, Shenghui .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 :371-379
[46]   ImageAugmenter: A user-friendly 3D Slicer tool for medical image augmentation [J].
Raggio, Ciro Benito ;
Zaffino, Paolo ;
Spadea, Maria Francesca .
SOFTWAREX, 2024, 28
[47]   Determination and evaluation of 3D biplane imaging geometries without a calibration object [J].
Sen, A ;
Esthappan, J ;
Lan, L ;
Chua, KG ;
Doi, K ;
Hoffmann, KR .
MEDICAL IMAGING 1998: IMAGE PROCESSING, PTS 1 AND 2, 1998, 3338 :1396-1402
[48]   GDR-Net: A Geometric Detail Recovering Network for 3D Scanned Objects [J].
Feng, Wanquan ;
Zhang, Juyong ;
Zhou, Yuanfeng ;
Xin, Shiqing .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (12) :3959-3973
[49]   3D GEOLOGICAL OUTCROP CHARACTERIZATION: AUTOMATIC DETECTION OF 3D PLANES (AZIMUTH AND DIP) USING LiDAR POINT CLOUDS [J].
Anders, K. ;
Haemmerle, M. ;
Miernik, G. ;
Drews, T. ;
Escalona, A. ;
Townsend, C. ;
Hoefle, B. .
XXIII ISPRS CONGRESS, COMMISSION V, 2016, 3 (05) :105-112
[50]   3D POSE ESTIMATION FROM MONOCULAR VIDEO WITH CAMERA-BONE ANGLE REGULARIZATION ON THE IMAGE FEATURE [J].
Ishii, Asuka ;
Ikeda, Hiroo .
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, :3740-3744