Synthetic Dataset Generation Method for Object Detection

被引：0

作者：

Zhou, Ningning ^{[1
]}

Li, Tong ^{[2
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Sch Comp, 9 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China

[2] Jinzhuan Informat Technol Co Ltd, Nanjing 210012, Jiangsu, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS | 2025年 / 18卷 / 01期

关键词：

Object detection; Synthetic data set; Global domain randomization; Automatic label annotation; Anti-vibration damper;

D O I：

10.1007/s44196-025-00817-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To address the high construction cost of datasets for object detection, particularly in industrial application scenarios where sufficient sample images cannot be obtained from the Internet due to the specialized nature and diversity of objects and their working environments, this paper proposes a method to automatically generate synthetic datasets and train object detection models on them. First, 3D models of the target devices are created and rendered to ensure that the synthetic images exhibit realistic texture and detail. Next, a simulation environment is constructed and the 3D models are integrated into this environment using global domain randomization techniques. Finally, computer graphics methods are applied to automatically annotate target objects in the synthetic images. This approach effectively reduces the cost of data acquisition while maintaining the detection accuracy of the models. Several mainstream object detection models, including Faster R-CNN, SSD, and YOLO, are trained on synthetic datasets of anti-vibration dampers. Experimental results on real-world images demonstrate that models trained on synthetic data achieve relatively high accuracy. Furthermore, fine-tuning these models with a very small number of real images significantly enhances their performance. In addition, the models exhibit robustness against interference and occlusion.

引用

页数：15

共 18 条

[1]

Angus M, 2018, IEEE INT C INTELL TR, P985, DOI 10.1109/ITSC.2018.8569519

[2] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[3] Relation Extraction in Underexplored Biomedical Domains: A Diversity-optimized Sampling and Synthetic Data Generation Approach [J].

Delmas, Maxime ;

Wysocka, Magdalena ;

Freitas, Andre .

COMPUTATIONAL LINGUISTICS, 2024, 50 (03) :953-1000

[4] FlowNet: Learning Optical Flow with Convolutional Networks [J].

Dosovitskiy, Alexey ;

Fischer, Philipp ;

Ilg, Eddy ;

Haeusser, Philip ;

Hazirbas, Caner ;

Golkov, Vladimir ;

van der Smagt, Patrick ;

Cremers, Daniel ;

Brox, Thomas .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2758-2766

[5] Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection [J].

Dwibedi, Debidatta ;

Misra, Ishan ;

Hebert, Martial .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1310-1319

[6] Virtual Worlds as Proxy for Multi-Object Tracking Analysis [J].

Gaidon, Adrien ;

Wang, Qiao ;

Cabon, Yohann ;

Vig, Eleonora .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4340-4349

[7] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916

[8] Synthetic data generation using building information models [J].

Hong, Yeji ;

Park, Somin ;

Kim, Hongjo ;

Kim, Hyoungkwan .

AUTOMATION IN CONSTRUCTION, 2021, 130

[9] Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis [J].

Huang, Rui ;

Zhang, Shu ;

Li, Tianyu ;

He, Ran .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2458-2467

[10] Unsupervised Primitive Discovery for Improved 3D Generative Modeling [J].

Khan, Salman H. ;

Guo, Yulan ;

Hayat, Munawar ;

Barnes, Nick .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9731-9740

← 1 2 →