Natural Scene Text Detection Based on Deep Supervised Fully Convolutional Network

被引:0
作者
Zhang, Nan [1 ]
Jin, Xiaoning [1 ]
Li, Xiaowei [1 ]
机构
[1] Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III | 2018年 / 11166卷
关键词
Scene image; Multi-oriented text; Deep supervision;
D O I
10.1007/978-3-030-00764-5_40
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the past few years, text detection in natural scenes has attracted increasing attention due to many real-world applications. Most existing methods only detect horizontal or nearly horizontal texts and have complicated processes. When using the neural network to detect text in the image, some ambiguity and small words are easy to be ignored because of many pooling operations. Therefore, this paper proposes an end-to-end trainable neural network for detecting multi-oriented text lines or words in natural scene images. The network fuses multi-level features and is guided by deep supervision during training. In this way, richer hierarchical representations can be learned automatically. The network makes two kinds of predictions: text/no text classification and location regression, thus we can directly locate multi-oriented words or text lines without other unnecessary intermediate steps. Experimental results on the ICDAR 2015 datasets and MSRA-TD500 datasets have proven that the proposed method outperforms the state-of-the-art methods by a noticeable margin on F-score.
引用
收藏
页码:439 / 448
页数:10
相关论文
empty
未找到相关数据