Could scene context be beneficial for scene text detection?

被引:31
|
作者
Zhu, Anna [1 ]
Gao, Renwu [2 ]
Uchida, Seiichi [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, State Key Lab Multispectral Informat Proc Technol, Wuhan 430074, Peoples R China
[2] Kyushu Univ, Human Interface Lab, Informat Sci & Elect Engn, Fukuoka 812, Japan
关键词
Scene text detection; Fully connected CRF; Convolutional neural network; Character feature; Context feature; READING TEXT; SEGMENTATION; IMAGE; RECOGNITION;
D O I
10.1016/j.patcog.2016.04.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if its surrounding segment, i.e., its context, is labeled as sky? In this paper, we have a positive answer by constructing a scene context-based text detection model. In this model, we use texton features and a fully-connected conditional random field (CRF) to estimate pixel-level scene class's probability to be considered as image's context feature. Meanwhile, maximally stable extremal regions (MSERs) are extracted, integrated and extended as image patches of character candidates. Then, each image patch is fed to a simple two-layer convolutional neural network (CNN) to automatically extract its character feature. The averaged context feature of the corresponding patch is considered as the patch's context feature. The character feature and context feature are fused as the input into a support vector machine for text/non-text determination. Finally, as a post-processing, neighboring text regions are grouped hierarchically. The performance evaluation on ICDAR2013 and SVT databases, as well as a preliminary evaluation on a patch-level database, proves that the scene context can improve the performance of scene text detection. Moreover, the comparative study with state-of-the-art methods shows the top-level performance of our method. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:204 / 215
页数:12
相关论文
共 50 条
  • [21] MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES
    Basavanna, M.
    Shivakumara, P.
    Srivatsa, S. K.
    Kumar, G. Hemantha
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [22] A cascaded method for text detection in natural scene images
    Zheng, Yang
    Li, Qing
    Liu, Jie
    Liu, Heping
    Li, Gen
    Zhang, Shuwu
    NEUROCOMPUTING, 2017, 238 : 307 - 315
  • [23] Detection of artificial and scene text in images and video frames
    Anthimopoulos, Marios
    Gatos, Basilis
    Pratikakis, Ioannis
    PATTERN ANALYSIS AND APPLICATIONS, 2013, 16 (03) : 431 - 446
  • [24] HAM: Hidden Anchor Mechanism for Scene Text Detection
    Hou, Jie-Bo
    Zhu, Xiaobin
    Liu, Chang
    Sheng, Kekai
    Wu, Long-Huang
    Wang, Hongfa
    Yin, Xu-Cheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7904 - 7916
  • [25] SPN: short path network for scene text detection
    Cai, Yuanqiang
    Wang, Weiqiang
    Ren, Haiqing
    Lu, Ke
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (10) : 6075 - 6087
  • [26] Integrated Method for Text Detection in Natural Scene Images
    Zheng, Yang
    Liu, Jie
    Liu, Heping
    Li, Qing
    Li, Gen
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (11): : 5583 - 5604
  • [27] Text kernel expansion for real-time scene text detection
    He, Tao
    Huang, Sheng
    Tang, Wenhao
    Liu, Bo
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (04)
  • [28] A pooling based scene text proposal technique for scene text reading in the wild
    Dinh NguyenVan
    Lu, Shijian
    Tian, Shangxuan
    Ouarti, Nizar
    Mokhtari, Mounir
    PATTERN RECOGNITION, 2019, 87 : 118 - 129
  • [29] TransText: Improving scene text detection via transformer
    Zhu, Jiajun
    Wang, Guodong
    DIGITAL SIGNAL PROCESSING, 2022, 130
  • [30] RECURRENT GLOBAL CONVOLUTIONAL NETWORK FOR SCENE TEXT DETECTION
    Mohanty, Sabyasachi
    Dutta, Tanima
    Gupta, Hari Prabhat
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2750 - 2754