Could scene context be beneficial for scene text detection?

被引:31
|
作者
Zhu, Anna [1 ]
Gao, Renwu [2 ]
Uchida, Seiichi [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, State Key Lab Multispectral Informat Proc Technol, Wuhan 430074, Peoples R China
[2] Kyushu Univ, Human Interface Lab, Informat Sci & Elect Engn, Fukuoka 812, Japan
关键词
Scene text detection; Fully connected CRF; Convolutional neural network; Character feature; Context feature; READING TEXT; SEGMENTATION; IMAGE; RECOGNITION;
D O I
10.1016/j.patcog.2016.04.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if its surrounding segment, i.e., its context, is labeled as sky? In this paper, we have a positive answer by constructing a scene context-based text detection model. In this model, we use texton features and a fully-connected conditional random field (CRF) to estimate pixel-level scene class's probability to be considered as image's context feature. Meanwhile, maximally stable extremal regions (MSERs) are extracted, integrated and extended as image patches of character candidates. Then, each image patch is fed to a simple two-layer convolutional neural network (CNN) to automatically extract its character feature. The averaged context feature of the corresponding patch is considered as the patch's context feature. The character feature and context feature are fused as the input into a support vector machine for text/non-text determination. Finally, as a post-processing, neighboring text regions are grouped hierarchically. The performance evaluation on ICDAR2013 and SVT databases, as well as a preliminary evaluation on a patch-level database, proves that the scene context can improve the performance of scene text detection. Moreover, the comparative study with state-of-the-art methods shows the top-level performance of our method. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:204 / 215
页数:12
相关论文
共 50 条
  • [41] Label distribution learning for scene text detection
    MA Haoyu
    LU Ningning
    MEI Junjun
    GUAN Tao
    ZHANG Yu
    GENG Xin
    Frontiers of Computer Science, 2023, 17 (06)
  • [42] A hierarchical recursive method for text detection in natural scene images
    Wang, Xiaobing
    Song, Yonghong
    Zhang, Yuanlin
    Xin, Jingmin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (24) : 26201 - 26223
  • [43] Scene text detection via decoupled feature pyramid networks
    Liang, Min
    Hou, Jie-Bo
    Zhu, Xiaobin
    Yang, Chun
    Qin, Jingyan
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (3) : 163 - 175
  • [44] Robust Scene Text Detection for Partially Annotated Training Data
    Keserwani, Prateek
    Saini, Rajkumar
    Liwicki, Marcus
    Roy, Partha Pratim
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8635 - 8645
  • [45] TextMountain: Accurate scene text detection via instance segmentation
    Zhu, Yixing
    Du, Jun
    PATTERN RECOGNITION, 2021, 110
  • [46] Deep learning approaches to scene text detection: a comprehensive review
    Khan, Tauseef
    Sarkar, Ram
    Mollah, Ayatullah Faruk
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3239 - 3298
  • [47] Scene Text Detection Using Attention with Depthwise Separable Convolutions
    Hassan, Ehtesham
    Lekshmi, V. L.
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [48] Collaborative Learning Network for Scene Text Detection
    Zhang, Xiaoye
    Yue, Yuanhao
    Yang, Yingyi
    Zhang, Xining
    Wang, Wei
    Zou, Qin
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6788 - 6793
  • [49] Chinese And English Bilingual Scene Text Detection
    Sha, Yuan
    Shi, Ping
    You, Jian
    Bao, Xiaojie
    Fu, Sizhe
    Zeng, Guoxiang
    2017 IEEE 3RD INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC), 2017, : 499 - 503
  • [50] Deep FCN for Arabic Scene Text Detection
    Beltaief, Ines
    Ben Halima, Mohamed
    2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2018, : 129 - 134