An Improved System For Real-Time Scene Text Recognition

被引:6
|
作者
Yang, Haojin [1 ]
Wang, Cheng [1 ]
Che, Xiaoyin [1 ]
Luo, Sheng [1 ]
Meinel, Christoph [1 ]
机构
[1] Univ Potsdam, HPI, POB 900460, D-14440 Potsdam, Germany
关键词
Video OCR; Scene Text Recognition; Multimedia Indexing; IMAGES; VIDEO;
D O I
10.1145/2671188.2749352
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we showcase a system for real-time text detection and recognition. We apply deep features created by Convolutional Neural Networks (CNNs) for both text detection and word recognition task. For text detection we follow the common localization-verification scheme which already shown its excellent ability in numerous previous work. In text localization stage, textual regions are roughly detected by using a MSERs (Maximally Stable Extremal Regions) detector with high recall rate. False alarms are then eliminated by using a CNNs classifier, and remaining text regions are further grouped into words. In the word recognition stage, we developed an skeleton-based text binarization method for segmenting text from its background. A CNNs based recognizer is then applied for recognizing character. The initial experiments show the powerful ability of deep features for text classification comparing with commonly used visual features. Our current implementation demonstrates real-time performance for recognizing scene text by using a standard PC with webcam.
引用
收藏
页码:657 / 660
页数:4
相关论文
共 50 条
  • [1] Real-Time Scene Text Localization and Recognition
    Neumann, Lukas
    Matas, Jiri
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3538 - 3545
  • [2] A Real-Time Scene Text to Speech System
    Neumann, Lukas
    Matas, Jiri
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 619 - 622
  • [3] A Portable Real-time Scene Recognition System on Smartphone
    Gui, Zhen-Wen
    2017 IEEE 8TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (UEMCON), 2017, : 457 - +
  • [4] Real-Time Lexicon-Free Scene Text Localization and Recognition
    Neumann, Lukas
    Matas, Jiri
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (09) : 1872 - 1885
  • [5] Text kernel expansion for real-time scene text detection
    He, Tao
    Huang, Sheng
    Tang, Wenhao
    Liu, Bo
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (04)
  • [6] Real-Time Scene Text Detection with Differentiable Binarization
    Liao, Minghui
    Wan, Zhaoyi
    Yao, Cong
    Chen, Kai
    Bai, Xiang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11474 - 11481
  • [7] SqueezedText: A Real-Time Scene Text Recognition by Binary Convolutional Encoder-Decoder Network
    Liu, Zichuan
    Li, Yixing
    Ren, Fengbo
    Goh, Wang Ling
    Yu, Hao
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7194 - 7201
  • [8] Real-Time Ukrainian Text Recognition and Voicing
    Tymoshenko, Kateryna
    Vysotska, Victoria
    Kovtun, Oksana
    Holoshchuk, Roman
    Holoshchuk, Svitlana
    COLINS 2021: COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS, VOL I, 2021, 2870
  • [9] Real-time Scene Recognition on Embedded System with SIFT Keypoints and a New Descriptor
    Xiao, Han
    He, Wenhao
    Yuan, Kui
    Wen, Feng
    2013 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2013, : 1317 - 1324
  • [10] Real-time Lexicon-free Scene Text Retrieval
    Mafla, Andres
    Tito, Ruben
    Dey, Sounak
    Gomez, Lluis
    Rusinol, Marcal
    Valveny, Ernest
    Karatzas, Dimosthenis
    PATTERN RECOGNITION, 2021, 110