Could scene context be beneficial for scene text detection?

被引：31

作者：

Zhu, Anna ^{[1
]}

Gao, Renwu ^{[2
]}

Uchida, Seiichi ^{[2
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Automat, State Key Lab Multispectral Informat Proc Technol, Wuhan 430074, Peoples R China

[2] Kyushu Univ, Human Interface Lab, Informat Sci & Elect Engn, Fukuoka 812, Japan

来源：

PATTERN RECOGNITION | 2016年 / 58卷

关键词：

Scene text detection; Fully connected CRF; Convolutional neural network; Character feature; Context feature; READING TEXT; SEGMENTATION; IMAGE; RECOGNITION;

D O I：

10.1016/j.patcog.2016.04.011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if its surrounding segment, i.e., its context, is labeled as sky? In this paper, we have a positive answer by constructing a scene context-based text detection model. In this model, we use texton features and a fully-connected conditional random field (CRF) to estimate pixel-level scene class's probability to be considered as image's context feature. Meanwhile, maximally stable extremal regions (MSERs) are extracted, integrated and extended as image patches of character candidates. Then, each image patch is fed to a simple two-layer convolutional neural network (CNN) to automatically extract its character feature. The averaged context feature of the corresponding patch is considered as the patch's context feature. The character feature and context feature are fused as the input into a support vector machine for text/non-text determination. Finally, as a post-processing, neighboring text regions are grouped hierarchically. The performance evaluation on ICDAR2013 and SVT databases, as well as a preliminary evaluation on a patch-level database, proves that the scene context can improve the performance of scene text detection. Moreover, the comparative study with state-of-the-art methods shows the top-level performance of our method. (C) 2016 Elsevier Ltd. All rights reserved.

引用

页码：204 / 215

页数：12

共 50 条

[21] MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES
Basavanna, M.
Shivakumara, P.
Srivatsa, S. K.
Kumar, G. Hemantha
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
[22] A cascaded method for text detection in natural scene images
Zheng, Yang
Li, Qing
Liu, Jie
Liu, Heping
Li, Gen
Zhang, Shuwu
NEUROCOMPUTING, 2017, 238 : 307 - 315
[23] Detection of artificial and scene text in images and video frames
Anthimopoulos, Marios
Gatos, Basilis
Pratikakis, Ioannis
PATTERN ANALYSIS AND APPLICATIONS, 2013, 16 (03) : 431 - 446
[24] HAM: Hidden Anchor Mechanism for Scene Text Detection
Hou, Jie-Bo
Zhu, Xiaobin
Liu, Chang
Sheng, Kekai
Wu, Long-Huang
Wang, Hongfa
Yin, Xu-Cheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7904 - 7916
[25] SPN: short path network for scene text detection
Cai, Yuanqiang
Wang, Weiqiang
Ren, Haiqing
Lu, Ke
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (10) : 6075 - 6087
[26] Integrated Method for Text Detection in Natural Scene Images
Zheng, Yang
Liu, Jie
Liu, Heping
Li, Qing
Li, Gen
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (11): : 5583 - 5604
[27] Text kernel expansion for real-time scene text detection
He, Tao
Huang, Sheng
Tang, Wenhao
Liu, Bo
PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (04)
[28] A pooling based scene text proposal technique for scene text reading in the wild
Dinh NguyenVan
Lu, Shijian
Tian, Shangxuan
Ouarti, Nizar
Mokhtari, Mounir
PATTERN RECOGNITION, 2019, 87 : 118 - 129
[29] TransText: Improving scene text detection via transformer
Zhu, Jiajun
Wang, Guodong
DIGITAL SIGNAL PROCESSING, 2022, 130
[30] RECURRENT GLOBAL CONVOLUTIONAL NETWORK FOR SCENE TEXT DETECTION
Mohanty, Sabyasachi
Dutta, Tanima
Gupta, Hari Prabhat
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2750 - 2754

← 1 2 3 4 5 →