Could scene context be beneficial for scene text detection?

被引:31
|
作者
Zhu, Anna [1 ]
Gao, Renwu [2 ]
Uchida, Seiichi [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, State Key Lab Multispectral Informat Proc Technol, Wuhan 430074, Peoples R China
[2] Kyushu Univ, Human Interface Lab, Informat Sci & Elect Engn, Fukuoka 812, Japan
关键词
Scene text detection; Fully connected CRF; Convolutional neural network; Character feature; Context feature; READING TEXT; SEGMENTATION; IMAGE; RECOGNITION;
D O I
10.1016/j.patcog.2016.04.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if its surrounding segment, i.e., its context, is labeled as sky? In this paper, we have a positive answer by constructing a scene context-based text detection model. In this model, we use texton features and a fully-connected conditional random field (CRF) to estimate pixel-level scene class's probability to be considered as image's context feature. Meanwhile, maximally stable extremal regions (MSERs) are extracted, integrated and extended as image patches of character candidates. Then, each image patch is fed to a simple two-layer convolutional neural network (CNN) to automatically extract its character feature. The averaged context feature of the corresponding patch is considered as the patch's context feature. The character feature and context feature are fused as the input into a support vector machine for text/non-text determination. Finally, as a post-processing, neighboring text regions are grouped hierarchically. The performance evaluation on ICDAR2013 and SVT databases, as well as a preliminary evaluation on a patch-level database, proves that the scene context can improve the performance of scene text detection. Moreover, the comparative study with state-of-the-art methods shows the top-level performance of our method. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:204 / 215
页数:12
相关论文
共 50 条
  • [1] LEVERAGING SURROUNDING CONTEXT FOR SCENE TEXT DETECTION
    Li, Yao
    Shen, Chunhua
    Jia, Wenjing
    van den Hengel, Anton
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2264 - 2268
  • [2] Robust Text Detection in Natural Scene Images
    Yin, Xu-Cheng
    Yin, Xuwang
    Huang, Kaizhu
    Hao, Hong-Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (05) : 970 - 983
  • [3] Scene Text Detection in Natural Images: A Review
    Cao, Dongping
    Zhong, Yong
    Wang, Lishun
    He, Yilong
    Dang, Jiachen
    SYMMETRY-BASEL, 2020, 12 (12): : 1 - 26
  • [4] Arbitrarily Shaped Scene Text Detection With a Mask Tightness Text Detector
    Liu, Yuliang
    Jin, Lianwen
    Fang, Chuanming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2918 - 2930
  • [5] CRF based text detection for natural scene images using convolutional neural network and context information
    Wang, Yanna
    Shi, Cunzhao
    Xiao, Baihua
    Wang, Chunheng
    Qi, Chengzuo
    NEUROCOMPUTING, 2018, 295 : 46 - 58
  • [6] FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network
    Liu, Fagui
    Chen, Cheng
    Gu, Dian
    Zheng, Jingzhong
    IEEE ACCESS, 2019, 7 : 44219 - 44228
  • [7] Scene Text Detection Based On Fusion Network
    Zhao, Xuezhuan
    Zhou, Ziheng
    Li, Lingling
    Pei, Lishen
    Ye, Zhaoyi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)
  • [8] A decade: Review of scene text detection methods
    Rainarli, Ednawati
    Suprapto
    Wahyono
    COMPUTER SCIENCE REVIEW, 2021, 42
  • [9] FDTA: Fully Convolutional Scene Text Detection With Text Attention
    Cao, Yongcun
    Ma, Shuaisen
    Pan, Haichuan
    IEEE ACCESS, 2020, 8 : 155441 - 155449
  • [10] Contextual Text Block Detection Towards Scene Text Understanding
    Xue, Chuhui
    Huang, Jiaxing
    Zhang, Wenqing
    Lu, Shijian
    Wang, Changhu
    Bai, Song
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 374 - 391