A pooling based scene text proposal technique for scene text reading in the wild

被引:12
|
作者
Dinh NguyenVan [1 ,5 ]
Lu, Shijian [2 ]
Tian, Shangxuan [3 ]
Ouarti, Nizar [1 ,5 ]
Mokhtari, Mounir [4 ,5 ]
机构
[1] Univ Paris 06, Sorbonne Univ, 4 Pl Jussieu, F-75252 Paris 05, France
[2] Nanyang Technol Univ, Nanyang Ave, Singapore 639798, Singapore
[3] Tencent Co LTD, Gaoxinnanyi Ave,Southern Dist Hitech Pk, Shenzhen 518057, Peoples R China
[4] Inst Mines Telecom, 37-39 Rue Dareau, F-75014 Paris, France
[5] CNRS, Image & Pervas Access Lab, UMI 2955, I2R, 1 Fusionopolis Way,21-01 Connexis South Tower, Singapore 138632, Singapore
关键词
Scene text proposal; Pooling based grouping; Scene text detection; Scene text reading; Scene text spotting; NEURAL-NETWORK; RECOGNITION; LOCALIZATION;
D O I
10.1016/j.patcog.2018.10.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic reading texts in scenes has attracted increasing interest in recent years as texts often carry rich semantic information that is useful for scene understanding. In this paper, we propose a novel scene text proposal technique aiming for accurate reading texts in scenes. Inspired by the pooling layer in the deep neural network architecture, a pooling based scene text proposal technique is developed. A novel score function is designed which exploits the histogram of oriented gradients and is capable of ranking the proposals according to their probabilities of being text. An end-to-end scene text reading system has also been developed by incorporating the proposed scene text proposal technique where false alarms elimination and words recognition are performed simultaneously. Extensive experiments over several public datasets show that the proposed technique can handle multi-orientation and multi-language scene texts and obtains outstanding proposal performance. The developed end-to-end systems also achieve very competitive scene text spotting and reading performance. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:118 / 129
页数:12
相关论文
共 50 条
  • [21] Scene text detection and recognition: a survey
    Naiemi, Fatemeh
    Ghods, Vahid
    Khalesi, Hassan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 20255 - 20290
  • [22] Contextual Text Block Detection Towards Scene Text Understanding
    Xue, Chuhui
    Huang, Jiaxing
    Zhang, Wenqing
    Lu, Shijian
    Wang, Changhu
    Bai, Song
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 374 - 391
  • [23] Reading Scene Text by Fusing Visual Attention with Semantic Representations
    Liu, Zhiguang
    Wang, Liangwei
    Qiao, Jian
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 210 - 218
  • [24] Scene Text Detection based on Structural Features
    Nguyen, Khanh
    Ngo Duc Thanh
    2016 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS, AND ITS APPLICATIONS (IC3INA) - RECENT PROGRESS IN COMPUTER, CONTROL, AND INFORMATICS FOR DATA SCIENCE, 2016, : 48 - 53
  • [25] Scene Text Detection Based On Fusion Network
    Zhao, Xuezhuan
    Zhou, Ziheng
    Li, Lingling
    Pei, Lishen
    Ye, Zhaoyi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)
  • [26] MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES
    Basavanna, M.
    Shivakumara, P.
    Srivatsa, S. K.
    Kumar, G. Hemantha
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [27] Scene Text Detection and Recognition: The Deep Learning Era
    Long, Shangbang
    He, Xin
    Yao, Cong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (01) : 161 - 184
  • [28] Arbitrarily Shaped Scene Text Detection With a Mask Tightness Text Detector
    Liu, Yuliang
    Jin, Lianwen
    Fang, Chuanming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2918 - 2930
  • [29] Scene Text Detection Based on Expanding the Text Center Region for Bilingual Tibetan-Chinese
    Li, Jincheng
    Hao, Yusheng
    Wang, Weilan
    Wang, Tiejun
    Li, Qiaoqiao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (13)
  • [30] Scene Text Detection with Text Statistical Characteristics and Deep Neural Network
    Qu, Yanyun
    Yang, Xiaodong
    Lin, Li
    COMPUTER VISION, PT III, 2017, 773 : 245 - 254