Could scene context be beneficial for scene text detection?

被引：31

作者：

Zhu, Anna ^{[1
]}

Gao, Renwu ^{[2
]}

Uchida, Seiichi ^{[2
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Automat, State Key Lab Multispectral Informat Proc Technol, Wuhan 430074, Peoples R China

[2] Kyushu Univ, Human Interface Lab, Informat Sci & Elect Engn, Fukuoka 812, Japan

来源：

PATTERN RECOGNITION | 2016年 / 58卷

关键词：

Scene text detection; Fully connected CRF; Convolutional neural network; Character feature; Context feature; READING TEXT; SEGMENTATION; IMAGE; RECOGNITION;

D O I：

10.1016/j.patcog.2016.04.011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text detection and scene segmentation are meaningful tasks in the computer vision field. Could the semantic scene segmentation assist scene text detection in any degree? For example, can we expect the probability of a region being text is low if its surrounding segment, i.e., its context, is labeled as sky? In this paper, we have a positive answer by constructing a scene context-based text detection model. In this model, we use texton features and a fully-connected conditional random field (CRF) to estimate pixel-level scene class's probability to be considered as image's context feature. Meanwhile, maximally stable extremal regions (MSERs) are extracted, integrated and extended as image patches of character candidates. Then, each image patch is fed to a simple two-layer convolutional neural network (CNN) to automatically extract its character feature. The averaged context feature of the corresponding patch is considered as the patch's context feature. The character feature and context feature are fused as the input into a support vector machine for text/non-text determination. Finally, as a post-processing, neighboring text regions are grouped hierarchically. The performance evaluation on ICDAR2013 and SVT databases, as well as a preliminary evaluation on a patch-level database, proves that the scene context can improve the performance of scene text detection. Moreover, the comparative study with state-of-the-art methods shows the top-level performance of our method. (C) 2016 Elsevier Ltd. All rights reserved.

引用

页码：204 / 215

页数：12

共 50 条

[31] Deep Residual Text Detection Network for Scene Text
Zhu, Xiangyu
Jiang, Yingying
Yang, Shuli
Wang, Xiaobing
Li, Wei
Fu, Pei
Wang, Hua
Luo, Zhenbo
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812
[32] Scene Text Detection Based on Text Stroke Components
Hou, Xinyue
Cheng, Pengsen
Gao, Hongyu
Li, Xin
Liu, Jiayong
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2025, 35 (05)
[33] TKDN: Scene Text Detection via Keypoints Detection
Cui, Yuanshun
Li, Jie
Han, Hu
Shan, Shiguang
Chen, Xilin
COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 231 - 246
[34] A Text-Specific Domain Adaptive Network for Scene Text Detection in the Wild
He, Xuan
Yuan, Jin
Li, Mengyao
Wang, Runmin
Wang, Haidong
Li, Zhiyong
APPLIED INTELLIGENCE, 2023, 53 (22) : 26827 - 26839
[35] Scene Text Detection based on Structural Features
Nguyen, Khanh
Ngo Duc Thanh
2016 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS, AND ITS APPLICATIONS (IC3INA) - RECENT PROGRESS IN COMPUTER, CONTROL, AND INFORMATICS FOR DATA SCIENCE, 2016, : 48 - 53
[36] Label distribution learning for scene text detection
Ma, Haoyu
Lu, Ningning
Mei, Junjun
Guan, Tao
Zhang, Yu
Geng, Xin
FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (06)
[37] SCENE TEXT DETECTION WITH SUPERPIXELS AND HIERARCHICAL MODEL
Zhou, Gang.
Liu, Yuehu.
Tian, Zhiqiang.
2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1001 - 1004
[38] FEATURE FUSION NETWORK FOR SCENE TEXT DETECTION
Cai, Chenqin
Lv, Pin
Su, Bing
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2755 - 2759
[39] Refinement Correction Network for Scene Text Detection
Lian, Zhe
Yin, Yanjun
Hu, Wei
Xu, Qiaozhi
Zhi, Min
Lu, Jingfang
Qi, Xuanhao
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VIII, ICIC 2024, 2024, 14869 : 93 - 105
[40] Review on text detection methods on scene images
Brisinello, Matteo
Grbic, Ratko
Vranjes, Mario
Vranjes, Denis
2019 61ST INTERNATIONAL SYMPOSIUM ELMAR, 2019, : 51 - 56

← 1 2 3 4 5 →