A pooling based scene text proposal technique for scene text reading in the wild

被引:12
|
作者
Dinh NguyenVan [1 ,5 ]
Lu, Shijian [2 ]
Tian, Shangxuan [3 ]
Ouarti, Nizar [1 ,5 ]
Mokhtari, Mounir [4 ,5 ]
机构
[1] Univ Paris 06, Sorbonne Univ, 4 Pl Jussieu, F-75252 Paris 05, France
[2] Nanyang Technol Univ, Nanyang Ave, Singapore 639798, Singapore
[3] Tencent Co LTD, Gaoxinnanyi Ave,Southern Dist Hitech Pk, Shenzhen 518057, Peoples R China
[4] Inst Mines Telecom, 37-39 Rue Dareau, F-75014 Paris, France
[5] CNRS, Image & Pervas Access Lab, UMI 2955, I2R, 1 Fusionopolis Way,21-01 Connexis South Tower, Singapore 138632, Singapore
关键词
Scene text proposal; Pooling based grouping; Scene text detection; Scene text reading; Scene text spotting; NEURAL-NETWORK; RECOGNITION; LOCALIZATION;
D O I
10.1016/j.patcog.2018.10.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic reading texts in scenes has attracted increasing interest in recent years as texts often carry rich semantic information that is useful for scene understanding. In this paper, we propose a novel scene text proposal technique aiming for accurate reading texts in scenes. Inspired by the pooling layer in the deep neural network architecture, a pooling based scene text proposal technique is developed. A novel score function is designed which exploits the histogram of oriented gradients and is capable of ranking the proposals according to their probabilities of being text. An end-to-end scene text reading system has also been developed by incorporating the proposed scene text proposal technique where false alarms elimination and words recognition are performed simultaneously. Extensive experiments over several public datasets show that the proposed technique can handle multi-orientation and multi-language scene texts and obtains outstanding proposal performance. The developed end-to-end systems also achieve very competitive scene text spotting and reading performance. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:118 / 129
页数:12
相关论文
共 50 条
  • [31] Deep Residual Text Detection Network for Scene Text
    Zhu, Xiangyu
    Jiang, Yingying
    Yang, Shuli
    Wang, Xiaobing
    Li, Wei
    Fu, Pei
    Wang, Hua
    Luo, Zhenbo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812
  • [32] Towards complex scene text reading with selective region proposal and two-stage deep reinforcement learning
    Harizi, Riadh
    Walha, Rim
    Drira, Fadoua
    APPLIED SOFT COMPUTING, 2025, 170
  • [33] SCALE-INVARIANT MULTI-ORIENTED TEXT DETECTION IN WILD SCENE IMAGE
    Dasgupta, Kinjal
    Das, Sudip
    Bhattacharya, Ujjwal
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2041 - 2045
  • [34] Robust Text Detection in Natural Scene Images
    Yin, Xu-Cheng
    Yin, Xuwang
    Huang, Kaizhu
    Hao, Hong-Wei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (05) : 970 - 983
  • [35] Turning a CLIP Model Into a Scene Text Spotter
    Yu, Wenwen
    Liu, Yuliang
    Zhu, Xingkui
    Cao, Haoyu
    Sun, Xing
    Bai, Xiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6040 - 6054
  • [36] Scene Text Detection and Recognition Based on Iterative Correction
    Xiong, Li
    Gui, Ziyan
    Ou, Ying
    Xu, Wenxia
    PROCEEDINGS OF 2022 5TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA2022, 2022, : 7 - 10
  • [37] Scene text spotting based on end-to-end
    Wei G.
    Rong W.
    Liang Y.
    Xiao X.
    Liu X.
    Journal of Intelligent and Fuzzy Systems, 2021, 40 (05) : 8871 - 8881
  • [38] ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images
    Shahab, Asif
    Shafait, Faisal
    Dengel, Andreas
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1491 - 1496
  • [39] PSENet-based efficient scene text detection
    Liao, Guanglong
    Zhu, Zhongjie
    Bai, Yongqiang
    Liu, Tingna
    Xie, Zhibo
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)
  • [40] Urdu text in natural scene images: a new dataset and preliminary text detection
    Ali, Hazrat
    Iqbal, Khalid
    Mujtaba, Ghulam
    Fayyaz, Ahmad
    Bulbul, Mohammad Farhad
    Karam, Fazal Wahab
    Zahir, Ali
    PEERJ COMPUTER SCIENCE, 2021, 7