Augmented Scene Text Recognition Using Crosswise Feature Extraction

被引:0
作者
Cinu C Kiliroor
S. Shrija
R. Ajay
机构
[1] Vellore Institute of Technology,
[2] Madras Institute of Technology,undefined
[3] Anna University,undefined
来源
Wireless Personal Communications | 2022年 / 123卷
关键词
CNN; Crosswise feature extraction; Text recognition; Feedforward Neural Network (NN);
D O I
暂无
中图分类号
学科分类号
摘要
In today’s highly computerized society, detection and recognition of text present in natural scene images is complex and difficult to be properly recognized by human vision. Most of the existing algorithms and models mainly focus on detection and recognition of text from still images. Many of the recent machine translation systems are built using the Encoder-Decoder framework which works on the format of encoding the sequence of input and then based on the encoded input, the output is decoded. Both the encoder and the decoder use an attention mechanism as an interface, making the model complex. Aiming at this situation, an alternative method for recognition of texts from videos is proposed. The proposed approach is based on a single Two-Dimensional Convolutional Neural Network (2D CNN). An algorithm for extracting features from an image called the crosswise feature extraction is also proposed. The proposed model is tested and shows that crosswise feature extraction gives better recognition accuracy by requiring a lesser period of time for training than the conventional feature extraction technique used by CNN.
引用
收藏
页码:421 / 436
页数:15
相关论文
共 65 条
  • [1] Zuo L(2019)Natural scene TextRecognition based on encoder-decoder framework IEEE Access 7 62616-62623
  • [2] Sun H(2019)A detection and verification model based on SSD and encoder-decoder network for scene text detection IEEE Access 7 71299-71310
  • [3] Mao Q(2017)Deep neural network with attention model for scene text recognition IET Computer Vision 11 605-612
  • [4] Qi R(2018)Scene video text tracking with graph matching IEEE Access 6 19419-19426
  • [5] Jia R(2019)Multi-script-oriented text detection and recognition in video/scene/born digital images IEEE Transactions on Circuits and Systems for Video Technology 29 1145-1162
  • [6] Gao X(2018)Multi-oriented and multi-lingual scene text detection with direct regression IEEE Transactions on Image Processing 27 5406-5419
  • [7] Han S(2019)A fast scene text detector using knowledge distillation IEEE Access 7 22588-22598
  • [8] Luo C(2017)Tracking based multi-orientation scene text detection: A unified framework with dynamic programming IEEE Transactions on Image Processing 26 3235-3248
  • [9] Li S(2017)Convolutional recurrent neural networks with hidden Markov model bootstrap for scene text recognition IET Computer Vision 11 497-504
  • [10] Tang M(2017)An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition IEEE Transactions on Pattern Analysis and Machine Intelligence 39 2298-2304