Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引：4

作者：

Wei, Guangcun ^{[1
,2
]}

Rong, Wansheng ^{[1
]}

Liang, Yongquan ^{[1
]}

Xiao, Xinguang ^{[1
]}

Liu, Xiang ^{[1
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China

[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷 / 08期

关键词：

Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;

D O I：

10.1109/ACCESS.2020.3020387

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.

引用

页码：159906 / 159914

页数：9

共 50 条

[1] Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting
Lu, Pu
Wang, Hao
Zhu, Shenggao
Wang, Jing
Bai, Xiang
Liu, Wenyu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6200 - 6212
[2] Scene text spotting based on end-to-end
Wei G.
Rong W.
Liang Y.
Xiao X.
Liu X.
Journal of Intelligent and Fuzzy Systems, 2021, 40 (05) : 8871 - 8881
[3] Towards End-to-End Text Spotting in Natural Scenes
Wang, Peng
Li, Hui
Shen, Chunhua
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 7266 - 7281
[4] POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING
Zhang, Yi
Yang, Wei
Xu, Zhenbo
Li, Yingjie
Chen, Zhi
Huang, Liusheng
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2375 - 2379
[5] Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Liao, Minghui
Lyu, Pengyuan
He, Minghang
Yao, Cong
Wu, Wenhao
Bai, Xiang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 532 - 548
[6] PAN plus plus : Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text
Wang, Wenhai
Xie, Enze
Li, Xiang
Liu, Xuebo
Liang, Ding
Zhibo, Yang
Lu, Tong
Shen, Chunhua
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5349 - 5367
[7] End-to-End Video Text Spotting with Transformer
Wu, Weijia
Cai, Yuanqiang
Shen, Chunhua
Zhang, Debing
Fu, Ying
Zhou, Hong
Luo, Ping
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4019 - 4035
[8] FREE: A Fast and Robust End-to-End Video Text Spotter
Cheng, Zhanzhan
Lu, Jing
Zou, Baorui
Qiao, Liang
Xu, Yunlu
Pu, Shiliang
Niu, Yi
Wu, Fei
Zhou, Shuigeng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 822 - 837
[9] Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Lyu, Pengyuan
Liao, Minghui
Yao, Cong
Wu, Wenhao
Bai, Xiang
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 71 - 88
[10] ADATS: Adaptive RoI-Align based Transformer for End-to-End Text Spotting
Huang, Zepeng
Wan, Qi
Chen, Junliang
Zhao, Xiaodong
Ye, Kai
Shen, Linlin
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1403 - 1408

← 1 2 3 4 5 →