Synthetic Dataset Generation Method for Object Detection

被引:0
作者
Zhou, Ningning [1 ]
Li, Tong [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp, 9 Wenyuan Rd, Nanjing 210023, Jiangsu, Peoples R China
[2] Jinzhuan Informat Technol Co Ltd, Nanjing 210012, Jiangsu, Peoples R China
关键词
Object detection; Synthetic data set; Global domain randomization; Automatic label annotation; Anti-vibration damper;
D O I
10.1007/s44196-025-00817-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To address the high construction cost of datasets for object detection, particularly in industrial application scenarios where sufficient sample images cannot be obtained from the Internet due to the specialized nature and diversity of objects and their working environments, this paper proposes a method to automatically generate synthetic datasets and train object detection models on them. First, 3D models of the target devices are created and rendered to ensure that the synthetic images exhibit realistic texture and detail. Next, a simulation environment is constructed and the 3D models are integrated into this environment using global domain randomization techniques. Finally, computer graphics methods are applied to automatically annotate target objects in the synthetic images. This approach effectively reduces the cost of data acquisition while maintaining the detection accuracy of the models. Several mainstream object detection models, including Faster R-CNN, SSD, and YOLO, are trained on synthetic datasets of anti-vibration dampers. Experimental results on real-world images demonstrate that models trained on synthetic data achieve relatively high accuracy. Furthermore, fine-tuning these models with a very small number of real images significantly enhances their performance. In addition, the models exhibit robustness against interference and occlusion.
引用
收藏
页数:15
相关论文
共 18 条
  • [1] Angus M, 2018, IEEE INT C INTELL TR, P985, DOI 10.1109/ITSC.2018.8569519
  • [2] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [3] Relation Extraction in Underexplored Biomedical Domains: A Diversity-optimized Sampling and Synthetic Data Generation Approach
    Delmas, Maxime
    Wysocka, Magdalena
    Freitas, Andre
    [J]. COMPUTATIONAL LINGUISTICS, 2024, 50 (03) : 953 - 1000
  • [4] FlowNet: Learning Optical Flow with Convolutional Networks
    Dosovitskiy, Alexey
    Fischer, Philipp
    Ilg, Eddy
    Haeusser, Philip
    Hazirbas, Caner
    Golkov, Vladimir
    van der Smagt, Patrick
    Cremers, Daniel
    Brox, Thomas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
  • [5] Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
    Dwibedi, Debidatta
    Misra, Ishan
    Hebert, Martial
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1310 - 1319
  • [6] Virtual Worlds as Proxy for Multi-Object Tracking Analysis
    Gaidon, Adrien
    Wang, Qiao
    Cabon, Yohann
    Vig, Eleonora
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4340 - 4349
  • [7] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
  • [8] Synthetic data generation using building information models
    Hong, Yeji
    Park, Somin
    Kim, Hongjo
    Kim, Hyoungkwan
    [J]. AUTOMATION IN CONSTRUCTION, 2021, 130
  • [9] Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis
    Huang, Rui
    Zhang, Shu
    Li, Tianyu
    He, Ran
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2458 - 2467
  • [10] Unsupervised Primitive Discovery for Improved 3D Generative Modeling
    Khan, Salman H.
    Guo, Yulan
    Hayat, Munawar
    Barnes, Nick
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9731 - 9740