A pooling based scene text proposal technique for scene text reading in the wild

被引:12
|
作者
Dinh NguyenVan [1 ,5 ]
Lu, Shijian [2 ]
Tian, Shangxuan [3 ]
Ouarti, Nizar [1 ,5 ]
Mokhtari, Mounir [4 ,5 ]
机构
[1] Univ Paris 06, Sorbonne Univ, 4 Pl Jussieu, F-75252 Paris 05, France
[2] Nanyang Technol Univ, Nanyang Ave, Singapore 639798, Singapore
[3] Tencent Co LTD, Gaoxinnanyi Ave,Southern Dist Hitech Pk, Shenzhen 518057, Peoples R China
[4] Inst Mines Telecom, 37-39 Rue Dareau, F-75014 Paris, France
[5] CNRS, Image & Pervas Access Lab, UMI 2955, I2R, 1 Fusionopolis Way,21-01 Connexis South Tower, Singapore 138632, Singapore
关键词
Scene text proposal; Pooling based grouping; Scene text detection; Scene text reading; Scene text spotting; NEURAL-NETWORK; RECOGNITION; LOCALIZATION;
D O I
10.1016/j.patcog.2018.10.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic reading texts in scenes has attracted increasing interest in recent years as texts often carry rich semantic information that is useful for scene understanding. In this paper, we propose a novel scene text proposal technique aiming for accurate reading texts in scenes. Inspired by the pooling layer in the deep neural network architecture, a pooling based scene text proposal technique is developed. A novel score function is designed which exploits the histogram of oriented gradients and is capable of ranking the proposals according to their probabilities of being text. An end-to-end scene text reading system has also been developed by incorporating the proposed scene text proposal technique where false alarms elimination and words recognition are performed simultaneously. Extensive experiments over several public datasets show that the proposed technique can handle multi-orientation and multi-language scene texts and obtains outstanding proposal performance. The developed end-to-end systems also achieve very competitive scene text spotting and reading performance. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:118 / 129
页数:12
相关论文
共 50 条
  • [41] PSENet-based efficient scene text detection
    Guanglong Liao
    Zhongjie Zhu
    Yongqiang Bai
    Tingna Liu
    Zhibo Xie
    EURASIP Journal on Advances in Signal Processing, 2021
  • [42] Deep learning for detection of text polarity in natural scene images
    Perepu, Pavan Kumar
    NEUROCOMPUTING, 2021, 431 : 1 - 6
  • [43] Text detection in scene images based on exhaustive segmentation
    Wei, Yuanwang
    Zhang, Zhijiang
    Shen, Wei
    Zeng, Dan
    Fang, Mei
    Zhou, Shifu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 50 : 1 - 8
  • [44] Buffer-text: Detecting arbitrary shaped text in natural scene image
    Yang, Ke
    Yi, Jizheng
    Chen, Aibin
    Jin, Ze
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [45] AT-Text: Assembling Text Components for Efficient Dense Scene Text Detection
    Li, Haiyan
    Lu, Hongtao
    FUTURE INTERNET, 2020, 12 (11): : 1 - 14
  • [46] Real-time Scene Text Detection Based on Stroke Model
    Liu, Yi
    Zhang, Dongming
    Zhang, Yongdong
    Lin, Shouxun
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3116 - 3120
  • [47] Rotated Box Is Back: An Accurate Box Proposal Network for Scene Text Detection
    Lee, Jusung
    Lee, Jaemyung
    Yang, Cheoljong
    Lee, Younghyun
    Lee, Joonsoo
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 49 - 63
  • [48] Video Scene Text Frames Categorization for Text Detection and Recognition
    Qin, Longfei
    Shivakumara, Palaiahnakote
    Lu, Tong
    Pal, Umapada
    Tan, Chew Lim
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
  • [49] Scene Text Localization Using Keypoints
    Erdogmus, Nesli
    Ozuysal, Mustafa
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1917 - 1920
  • [50] Text Proposals Based on Windowed Maximally Stable Extremal Region for Scene Text Detection
    Su, Feng
    Ding, Wenjun
    Wang, Lan
    Shan, Susu
    Xu, Hailiang
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 376 - 381