Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization

被引：27

作者：

Eversberg, Leon ^{[1
]}

Lambrecht, Jens ^{[1
]}

机构：

[1] Tech Univ Berlin, Chair Ind Grade Networks & Clouds, Str 17 Juni 135, D-10623 Berlin, Germany

来源：

SENSORS | 2021年 / 21卷 / 23期

关键词：

data-centric AI; deep learning; domain randomization; image synthesis; object detection; physics-based rendering; synthetic images; DATASET;

D O I：

10.3390/s21237901

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Limited training data is one of the biggest challenges in the industrial application of deep learning. Generating synthetic training images is a promising solution in computer vision; however, minimizing the domain gap between synthetic and real-world images remains a problem. Therefore, based on a real-world application, we explored the generation of images with physics-based rendering for an industrial object detection task. Setting up the render engine's environment requires a lot of choices and parameters. One fundamental question is whether to apply the concept of domain randomization or use domain knowledge to try and achieve photorealism. To answer this question, we compared different strategies for setting up lighting, background, object texture, additional foreground objects and bounding box computation in a data-centric approach. We compared the resulting average precision from generated images with different levels of realism and variability. In conclusion, we found that domain randomization is a viable strategy for the detection of industrial objects. However, domain knowledge can be used for object-related aspects to improve detection performance. Based on our results, we provide guidelines and an open-source tool for the generation of synthetic images for new industrial applications.

引用

页数：26

共 55 条

[1]

Andulkar M, 2018, IEEE INT CON AUTO SC, P624, DOI 10.1109/COASE.2018.8560470

[2]

[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4

[3]

[Anonymous], 2009, Graphic Technology and PhotographyViewing Conditions

[4]

Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35

[5]

Calli B, 2015, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), P510, DOI 10.1109/ICAR.2015.7251504

[6]

Charity Mitchell, WHAT COLOR IS BLACKB

[7]

Denninger M., 2020, INT C ROBOTICS SCIEN

[8] Introducing MVTec ITODD - A Dataset for 3D Object Recognition in Industry [J].

Drost, Bertram ;

Ulrich, Markus ;

Bergmann, Paul ;

Haertinger, Philipp ;

Steger, Carsten .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2200-2208

[9] Modeling Visual Context Is Key to Augmenting Object Detection Datasets [J].

Dvornik, Nikita ;

Mairal, Julien ;

Schmid, Cordelia .

COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :375-391

[10] Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection [J].

Dwibedi, Debidatta ;

Misra, Ishan ;

Hebert, Martial .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1310-1319

← 1 2 3 4 5 6 →