SynthText3D: synthesizing scene text images from 3D virtual worlds

被引:0
|
作者
Minghui Liao
Boyu Song
Shangbang Long
Minghang He
Cong Yao
Xiang Bai
机构
[1] Huazhong University of Science and Technology,School of Electronic Information and Communications
[2] Peking University,School of Electronics Engineering and Computer Science
[3] Peking University,School of Economics
[4] MEG VII,undefined
来源
Science China Information Sciences | 2020年 / 63卷
关键词
optical character recognition (OCR); synthetic data; scene text detection; 3D; deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
With the development of deep neural networks, the demand for a significant amount of annotated training data becomes the performance bottlenecks in many fields of research and applications. Image synthesis can generate annotated images automatically and freely, which gains increasing attention recently. In this paper, we propose to synthesize scene text images from the 3D virtual worlds, where the precise descriptions of scenes, editable illumination/visibility, and realistic physics are provided. Different from the previous methods which paste the rendered text on static 2D images, our method can render the 3D virtual scene and text instances as an entirety. In this way, real-world variations, including complex perspective transformations, various illuminations, and occlusions, can be realized in our synthesized scene text images. Moreover, the same text instances with various viewpoints can be produced by randomly moving and rotating the virtual camera, which acts as human eyes. The experiments on the standard scene text detection benchmarks using the generated synthetic data demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
相关论文
共 50 条
  • [1] SynthText3D:synthesizing scene text images from 3D virtual worlds
    Minghui LIAO
    Boyu SONG
    Shangbang LONG
    Minghang HE
    Cong YAO
    Xiang BAI
    ScienceChina(InformationSciences), 2020, 63 (02) : 65 - 78
  • [2] SynthText3D: synthesizing scene text images from 3D virtual worlds
    Liao, Minghui
    Song, Boyu
    Long, Shangbang
    He, Minghang
    Yao, Cong
    Bai, Xiang
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (02)
  • [3] Creation of 3D Scene from Raw Text
    Dessai, Sneha N.
    Dhanaraj, Rachel
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 1466 - 1469
  • [4] Virtual Worlds for 3D Visualizations
    Pirker, Johanna
    Guetla, Christian
    WORKSHOP PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS, 2015, 19 : 265 - 272
  • [5] 3D SCENE RECONSTRUCTION FROM RGB IMAGES
    Rotaru, Razvan-Paul
    Gradinaru, Alexandru
    Moldoveanu, Florica
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2024, 86 (02): : 101 - 112
  • [6] 3D SCENE RECONSTRUCTION FROM RGB IMAGES
    Rotaru, Răzvan-Paul
    Grădinaru, Alexandru
    Moldoveanu, Florica
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2024, 86 (02): : 101 - 112
  • [7] Research of 3D Virtual Scene Generation and Visualization Based on Images
    Feng, Jian-ping
    Wu, Li-hua
    Ma, Sheng-Quan
    ECOSYSTEM ASSESSMENT AND FUZZY SYSTEMS MANAGEMENT, 2014, 254 : 375 - 386
  • [8] 3D Fluid Scene Synthesizing Based on Video
    Quan, Hongyan
    Xue, Hanyu
    Song, Xiao
    ASIASIM 2014, 2014, 474 : 243 - +
  • [9] Synthesizing 3D images based on voxels
    Son, JY
    Javidi, B
    Saveljev, VV
    OPTICAL INFORMATION SYSTEMS, 2003, 5202 : 1 - 11
  • [10] Generation 3D: Living in virtual worlds
    Macedonia, Mike
    COMPUTER, 2007, 40 (10) : 99 - 101