Character Sequence Prediction Method for Training Data Creation in the Task of Text Recognition

被引:4
|
作者
Zlobin, Pavel K. [1 ,2 ]
Chernyshova, Yulia S. [1 ,2 ]
Sheshkus, Alexander, V [1 ,2 ]
Arlazarov, Vladimir V. [1 ,3 ]
机构
[1] Smart Engines Serv LLC, Moscow, Russia
[2] Russian Acad Sci, Fed Res Ctr Comp Sci & Control, Moscow, Russia
[3] RAS, Inst Informat Transmiss Problems, Moscow, Russia
来源
FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021) | 2022年 / 12084卷
基金
俄罗斯基础研究基金会;
关键词
training data; neural network; OCR; synthetic data; text generation;
D O I
10.1117/12.2623773
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
For text line recognition, much attention is paid to augmentation of the training images. Yet the inner structure of the textual information in the images also affects the accuracy of the resulting model. In this paper, we propose an ANN-based method for textual data generation for printing in images with a background of a synthetic training sample. In our method we avoid the usage of completely random sequences as well as the dictionary-based ones. As a result, we gain the data that saves the basic properties of the target language model, such as the balance of vowels and consonants, but avoid the lexicon-based properties, like the prevalence of the specific characters. Moreover, as our method focuses only on high-levels features and does not try to generate the real words, we can use a small training sample and light-weight ANN for text generation. To check our method, we train three ANNs with same architecture, but with different training samples. We choose machine readable zones as a target field because of their structure that does not correspond with the ordinary lexicon. The results of the experiments on three public datasets of identity documents demonstrate the effectiveness of our method and allows to enhance the state-of-the art results for the target field.
引用
收藏
页数:9
相关论文
共 9 条
  • [1] On the Use of Neural Text Generation for the Task of Optical Character Recognition
    Mohammadi, Mahnaz
    Jaf, Sardar
    McGough, Andrew Stephen
    Breckon, Toby P.
    Matthews, Peter
    Theodoropoulos, Georgios
    Obara, Boguslaw
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [2] Multi-Task CTC for Joint Handwriting Recognition and Character Bounding Box Prediction
    Wigington, Curtis
    PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,
  • [3] Research on a Web System Data-Filling Method Based on Optical Character Recognition and Multi-Text Similarity
    Su, Hailu
    Kang, Ruiqing
    Fan, Yunli
    APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [4] An improved scene text extraction method using Conditional Random Field and Optical Character Recognition
    Zhang, Hongwei
    Liu, Changsong
    Yang, Cheng
    Ding, Xiaoqing
    Wang, KongQiao
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 708 - 712
  • [5] A vulnerability severity prediction method based on bimodal data and multi-task learning
    Du, Xiaozhi
    Zhang, Shiming
    Zhou, Yanrong
    Du, Hongyuan
    JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 213
  • [6] Pre-training Techniques for Improving Text-to-Speech Synthesis by Automatic Speech Recognition Based Data Enhancement
    Liu, Yazhu
    Xue, Shaofei
    Tang, Jian
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 162 - 172
  • [7] A new density-based method for reducing the amount of training data in k-NN text classification
    Yuan, Fang
    Yang, Liu
    Yu, Ge
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3372 - +
  • [8] Research On Pre-Training Method and Generalization Ability of Big Data Recognition Model of the Internet of Things
    Tan, Junyang
    Xia, Dan
    Dong, Shiyun
    Zhu, Honghao
    Xu, Binshi
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (05)
  • [9] A Sequence and Network Embedding Method for Bus Arrival Time Prediction Using GPS Trajectory Data Only
    Li, Changlin
    Lin, Shuai
    Zhang, Honglei
    Zhao, Hongke
    Liu, Lishan
    Jia, Ning
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (05) : 5024 - 5038