SynthText3D: synthesizing scene text images from 3D virtual worlds

被引：0

作者：

Minghui Liao

Boyu Song

Shangbang Long

Minghang He

Cong Yao

Xiang Bai

机构：

[1] Huazhong University of Science and Technology,School of Electronic Information and Communications

[2] Peking University,School of Electronics Engineering and Computer Science

[3] Peking University,School of Economics

[4] MEG VII,undefined

来源：

Science China Information Sciences | 2020年 / 63卷

关键词：

optical character recognition (OCR); synthetic data; scene text detection; 3D; deep learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

With the development of deep neural networks, the demand for a significant amount of annotated training data becomes the performance bottlenecks in many fields of research and applications. Image synthesis can generate annotated images automatically and freely, which gains increasing attention recently. In this paper, we propose to synthesize scene text images from the 3D virtual worlds, where the precise descriptions of scenes, editable illumination/visibility, and realistic physics are provided. Different from the previous methods which paste the rendered text on static 2D images, our method can render the 3D virtual scene and text instances as an entirety. In this way, real-world variations, including complex perspective transformations, various illuminations, and occlusions, can be realized in our synthesized scene text images. Moreover, the same text instances with various viewpoints can be produced by randomly moving and rotating the virtual camera, which acts as human eyes. The experiments on the standard scene text detection benchmarks using the generated synthetic data demonstrate the effectiveness and superiority of the proposed method.

引用

共 50 条

[1] SynthText3D:synthesizing scene text images from 3D virtual worlds
Minghui LIAO
Boyu SONG
Shangbang LONG
Minghang HE
Cong YAO
Xiang BAI
ScienceChina(InformationSciences), 2020, 63 (02) : 65 - 78
[2] SynthText3D: synthesizing scene text images from 3D virtual worlds
Liao, Minghui
Song, Boyu
Long, Shangbang
He, Minghang
Yao, Cong
Bai, Xiang
SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (02)
[3] Creation of 3D Scene from Raw Text
Dessai, Sneha N.
Dhanaraj, Rachel
2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 1466 - 1469
[4] Virtual Worlds for 3D Visualizations
Pirker, Johanna
Guetla, Christian
WORKSHOP PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS, 2015, 19 : 265 - 272
[5] 3D SCENE RECONSTRUCTION FROM RGB IMAGES
Rotaru, Razvan-Paul
Gradinaru, Alexandru
Moldoveanu, Florica
UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2024, 86 (02): : 101 - 112
[6] 3D SCENE RECONSTRUCTION FROM RGB IMAGES
Rotaru, Răzvan-Paul
Grădinaru, Alexandru
Moldoveanu, Florica
UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2024, 86 (02): : 101 - 112
[7] Research of 3D Virtual Scene Generation and Visualization Based on Images
Feng, Jian-ping
Wu, Li-hua
Ma, Sheng-Quan
ECOSYSTEM ASSESSMENT AND FUZZY SYSTEMS MANAGEMENT, 2014, 254 : 375 - 386
[8] 3D Fluid Scene Synthesizing Based on Video
Quan, Hongyan
Xue, Hanyu
Song, Xiao
ASIASIM 2014, 2014, 474 : 243 - +
[9] Synthesizing 3D images based on voxels
Son, JY
Javidi, B
Saveljev, VV
OPTICAL INFORMATION SYSTEMS, 2003, 5202 : 1 - 11
[10] Generation 3D: Living in virtual worlds
Macedonia, Mike
COMPUTER, 2007, 40 (10) : 99 - 101

← 1 2 3 4 5 →