Natural Scene Text Detection Based on Deep Supervised Fully Convolutional Network

被引:0
|
作者
Zhang, Nan [1 ]
Jin, Xiaoning [1 ]
Li, Xiaowei [1 ]
机构
[1] Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III | 2018年 / 11166卷
关键词
Scene image; Multi-oriented text; Deep supervision;
D O I
10.1007/978-3-030-00764-5_40
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the past few years, text detection in natural scenes has attracted increasing attention due to many real-world applications. Most existing methods only detect horizontal or nearly horizontal texts and have complicated processes. When using the neural network to detect text in the image, some ambiguity and small words are easy to be ignored because of many pooling operations. Therefore, this paper proposes an end-to-end trainable neural network for detecting multi-oriented text lines or words in natural scene images. The network fuses multi-level features and is guided by deep supervision during training. In this way, richer hierarchical representations can be learned automatically. The network makes two kinds of predictions: text/no text classification and location regression, thus we can directly locate multi-oriented words or text lines without other unnecessary intermediate steps. Experimental results on the ICDAR 2015 datasets and MSRA-TD500 datasets have proven that the proposed method outperforms the state-of-the-art methods by a noticeable margin on F-score.
引用
收藏
页码:439 / 448
页数:10
相关论文
共 50 条
  • [21] Scene Text-Line Extraction with Fully Convolutional Network and Refined Proposals
    Zeng, Guan-Xin
    Hou, Yu-Hong
    Su, Po-Chyi
    Kang, Li-Wei
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 1247 - 1251
  • [22] Text Detection in Natural Scenes using Fully Convolutional DenseNets
    Behzadi, Mitra
    Safabakhsh, Reza
    2018 4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2018, : 11 - 14
  • [23] Multi-Oriented Real-time Arabic Scene Text Detection with Deep Fully Convolutional Networks
    Sassi, M. Saifeddine Hadj
    Beltaief, Ines
    Zekri, Manel
    Ben Yahia, Sadok
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [24] A Unified Deep Neural Network for Scene Text Detection
    Li, Yixin
    Ma, Jinwen
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 101 - 112
  • [25] Scene Text Detection with Text Statistical Characteristics and Deep Neural Network
    Qu, Yanyun
    Yang, Xiaodong
    Lin, Li
    COMPUTER VISION, PT III, 2017, 773 : 245 - 254
  • [26] PCBSNet: A Pure Convolutional Bilateral Segmentation Network for Real-Time Natural Scene Text Detection
    Lian, Zhe
    Yin, Yanjun
    Zhi, Min
    Xu, Qiaozhi
    ELECTRONICS, 2023, 12 (14)
  • [27] Deep learning for detection of text polarity in natural scene images
    Perepu, Pavan Kumar
    NEUROCOMPUTING, 2021, 431 : 1 - 6
  • [28] Reading scene text with fully convolutional sequence modeling
    Gao, Yunze
    Chen, Yingying
    Wang, Jinqiao
    Tang, Ming
    Lu, Hanqing
    NEUROCOMPUTING, 2019, 339 : 161 - 170
  • [29] Scene Text Detection Based On Fusion Network
    Zhao, Xuezhuan
    Zhou, Ziheng
    Li, Lingling
    Pei, Lishen
    Ye, Zhaoyi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)
  • [30] Natural Scene Text Detection using Deep Neural Networks
    Mayank
    Bhowmick, Swapnamoy
    Kotecha, Disha
    Rege, Priti P.
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,