Could scene context be beneficial for scene text detection?

被引:31
|
作者
Zhu, Anna [1 ]
Gao, Renwu [2 ]
Uchida, Seiichi [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, State Key Lab Multispectral Informat Proc Technol, Wuhan 430074, Peoples R China
[2] Kyushu Univ, Human Interface Lab, Informat Sci & Elect Engn, Fukuoka 812, Japan
关键词
Scene text detection; Fully connected CRF; Convolutional neural network; Character feature; Context feature; READING TEXT; SEGMENTATION; IMAGE; RECOGNITION;
D O I
10.1016/j.patcog.2016.04.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if its surrounding segment, i.e., its context, is labeled as sky? In this paper, we have a positive answer by constructing a scene context-based text detection model. In this model, we use texton features and a fully-connected conditional random field (CRF) to estimate pixel-level scene class's probability to be considered as image's context feature. Meanwhile, maximally stable extremal regions (MSERs) are extracted, integrated and extended as image patches of character candidates. Then, each image patch is fed to a simple two-layer convolutional neural network (CNN) to automatically extract its character feature. The averaged context feature of the corresponding patch is considered as the patch's context feature. The character feature and context feature are fused as the input into a support vector machine for text/non-text determination. Finally, as a post-processing, neighboring text regions are grouped hierarchically. The performance evaluation on ICDAR2013 and SVT databases, as well as a preliminary evaluation on a patch-level database, proves that the scene context can improve the performance of scene text detection. Moreover, the comparative study with state-of-the-art methods shows the top-level performance of our method. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:204 / 215
页数:12
相关论文
共 50 条
  • [31] Deep Residual Text Detection Network for Scene Text
    Zhu, Xiangyu
    Jiang, Yingying
    Yang, Shuli
    Wang, Xiaobing
    Li, Wei
    Fu, Pei
    Wang, Hua
    Luo, Zhenbo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812
  • [32] Scene Text Detection Based on Text Stroke Components
    Hou, Xinyue
    Cheng, Pengsen
    Gao, Hongyu
    Li, Xin
    Liu, Jiayong
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2025, 35 (05)
  • [33] TKDN: Scene Text Detection via Keypoints Detection
    Cui, Yuanshun
    Li, Jie
    Han, Hu
    Shan, Shiguang
    Chen, Xilin
    COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 231 - 246
  • [34] A Text-Specific Domain Adaptive Network for Scene Text Detection in the Wild
    He, Xuan
    Yuan, Jin
    Li, Mengyao
    Wang, Runmin
    Wang, Haidong
    Li, Zhiyong
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26827 - 26839
  • [35] Scene Text Detection based on Structural Features
    Nguyen, Khanh
    Ngo Duc Thanh
    2016 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS, AND ITS APPLICATIONS (IC3INA) - RECENT PROGRESS IN COMPUTER, CONTROL, AND INFORMATICS FOR DATA SCIENCE, 2016, : 48 - 53
  • [36] Label distribution learning for scene text detection
    Ma, Haoyu
    Lu, Ningning
    Mei, Junjun
    Guan, Tao
    Zhang, Yu
    Geng, Xin
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (06)
  • [37] SCENE TEXT DETECTION WITH SUPERPIXELS AND HIERARCHICAL MODEL
    Zhou, Gang.
    Liu, Yuehu.
    Tian, Zhiqiang.
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1001 - 1004
  • [38] FEATURE FUSION NETWORK FOR SCENE TEXT DETECTION
    Cai, Chenqin
    Lv, Pin
    Su, Bing
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2755 - 2759
  • [39] Refinement Correction Network for Scene Text Detection
    Lian, Zhe
    Yin, Yanjun
    Hu, Wei
    Xu, Qiaozhi
    Zhi, Min
    Lu, Jingfang
    Qi, Xuanhao
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VIII, ICIC 2024, 2024, 14869 : 93 - 105
  • [40] Review on text detection methods on scene images
    Brisinello, Matteo
    Grbic, Ratko
    Vranjes, Mario
    Vranjes, Denis
    2019 61ST INTERNATIONAL SYMPOSIUM ELMAR, 2019, : 51 - 56