A pooling based scene text proposal technique for scene text reading in the wild

被引：12

作者：

Dinh NguyenVan ^{[1
,5
]}

Lu, Shijian ^{[2
]}

Tian, Shangxuan ^{[3
]}

Ouarti, Nizar ^{[1
,5
]}

Mokhtari, Mounir ^{[4
,5
]}

机构：

[1] Univ Paris 06, Sorbonne Univ, 4 Pl Jussieu, F-75252 Paris 05, France

[2] Nanyang Technol Univ, Nanyang Ave, Singapore 639798, Singapore

[3] Tencent Co LTD, Gaoxinnanyi Ave,Southern Dist Hitech Pk, Shenzhen 518057, Peoples R China

[4] Inst Mines Telecom, 37-39 Rue Dareau, F-75014 Paris, France

[5] CNRS, Image & Pervas Access Lab, UMI 2955, I2R, 1 Fusionopolis Way,21-01 Connexis South Tower, Singapore 138632, Singapore

来源：

PATTERN RECOGNITION | 2019年 / 87卷

关键词：

Scene text proposal; Pooling based grouping; Scene text detection; Scene text reading; Scene text spotting; NEURAL-NETWORK; RECOGNITION; LOCALIZATION;

D O I：

10.1016/j.patcog.2018.10.012

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic reading texts in scenes has attracted increasing interest in recent years as texts often carry rich semantic information that is useful for scene understanding. In this paper, we propose a novel scene text proposal technique aiming for accurate reading texts in scenes. Inspired by the pooling layer in the deep neural network architecture, a pooling based scene text proposal technique is developed. A novel score function is designed which exploits the histogram of oriented gradients and is capable of ranking the proposals according to their probabilities of being text. An end-to-end scene text reading system has also been developed by incorporating the proposed scene text proposal technique where false alarms elimination and words recognition are performed simultaneously. Extensive experiments over several public datasets show that the proposed technique can handle multi-orientation and multi-language scene texts and obtains outstanding proposal performance. The developed end-to-end systems also achieve very competitive scene text spotting and reading performance. (C) 2018 Elsevier Ltd. All rights reserved.

引用

页码：118 / 129

页数：12

共 50 条

[41] PSENet-based efficient scene text detection
Guanglong Liao
Zhongjie Zhu
Yongqiang Bai
Tingna Liu
Zhibo Xie
EURASIP Journal on Advances in Signal Processing, 2021
[42] Deep learning for detection of text polarity in natural scene images
Perepu, Pavan Kumar
NEUROCOMPUTING, 2021, 431 : 1 - 6
[43] Text detection in scene images based on exhaustive segmentation
Wei, Yuanwang
Zhang, Zhijiang
Shen, Wei
Zeng, Dan
Fang, Mei
Zhou, Shifu
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 50 : 1 - 8
[44] Buffer-text: Detecting arbitrary shaped text in natural scene image
Yang, Ke
Yi, Jizheng
Chen, Aibin
Jin, Ze
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
[45] AT-Text: Assembling Text Components for Efficient Dense Scene Text Detection
Li, Haiyan
Lu, Hongtao
FUTURE INTERNET, 2020, 12 (11): : 1 - 14
[46] Real-time Scene Text Detection Based on Stroke Model
Liu, Yi
Zhang, Dongming
Zhang, Yongdong
Lin, Shouxun
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3116 - 3120
[47] Rotated Box Is Back: An Accurate Box Proposal Network for Scene Text Detection
Lee, Jusung
Lee, Jaemyung
Yang, Cheoljong
Lee, Younghyun
Lee, Joonsoo
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 49 - 63
[48] Video Scene Text Frames Categorization for Text Detection and Recognition
Qin, Longfei
Shivakumara, Palaiahnakote
Lu, Tong
Pal, Umapada
Tan, Chew Lim
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
[49] Scene Text Localization Using Keypoints
Erdogmus, Nesli
Ozuysal, Mustafa
2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1917 - 1920
[50] Text Proposals Based on Windowed Maximally Stable Extremal Region for Scene Text Detection
Su, Feng
Ding, Wenjun
Wang, Lan
Shan, Susu
Xu, Hailiang
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 376 - 381

← 1 2 3 4 5 →