A Deep Learning-Based Text Detection and Recognition Approach for Natural Scenes

被引:4
作者
Li, Xuexiang [1 ]
机构
[1] Zibo Vocat Inst, Coll Automobile Engn, Zibo 255314, Peoples R China
关键词
Deep learning; natural scenes; text detection; text recognition;
D O I
10.1142/S0218126623500731
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we design a natural scene text detection and recognition model based on deep learning by model construction and in-depth study of wild scene text detection and recognition. This paper proposes a scene text recognition method based on connection time classification and attention mechanism for the situation where natural scene text is challenging to recognize due to the high complexity of text and background. The method converts the text recognition problem in natural scenes into a sequence recognition problem, avoiding the drawback of overall recognition performance degradation due to the difficulty of character segmentation. At the same time, the attention mechanism introduced can reduce the network complexity and improve the recognition accuracy. The performance of the improved PSE-based text detection algorithm in this paper is tested on the curved text datasets SCUT-ctw1500 and ICDAR2017 in natural scenes for comparison. The results show that the proposed algorithm achieves 88.5%, 77%, and 81.3% in the three indexes of accuracy, recall, and Fl value, respectively, without adding the pre-training module. The algorithm can detect text in any direction well without adding the pre-training module; the improved text recognition algorithm based on CRNN in this paper is tested on the natural scene dataset ICDAR2017, and the results show that the accuracy rate reaches 94.5% under the condition of no constraint, which is a good performance.
引用
收藏
页数:19
相关论文
共 24 条
  • [1] Adam EEB, 2020, J SOFT COMPUT PARADI, V2, P209, DOI DOI 10.36548/JSCP.2020.4.002
  • [2] A survey of state-of-the-art approaches for emotion recognition in text
    Alswaidan, Nourah
    Menai, Mohamed El Bachir
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (08) : 2937 - 2987
  • [3] Benchmarking performance of machine and deep learning-based methodologies for Urdu text document classification
    Asim, Muhammad Nabeel
    Ghani, Muhammad Usman
    Ibrahim, Muhammad Ali
    Mahmood, Waqar
    Dengel, Andreas
    Ahmed, Sheraz
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (11) : 5437 - 5469
  • [4] Bulatov K. B., 2020, Sensornye Sistemy, V34, P217, DOI 10.31857/S0235009220030026
  • [5] A deep learning based method for extracting semantic information from patent documents
    Chen, Liang
    Xu, Shuo
    Zhu, Lijun
    Zhang, Jing
    Lei, Xiaoping
    Yang, Guancan
    [J]. SCIENTOMETRICS, 2020, 125 (01) : 289 - 312
  • [6] Deep Learning-Based Approach for Low Probability of Intercept Radar Signal Detection and Classification
    Ghadimi, G.
    Norouzi, Y.
    Bayderkhani, R.
    Nayebi, M. M.
    Karbasi, S. M.
    [J]. JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2020, 65 (10) : 1179 - 1191
  • [7] Generating Text Sequence Images for Recognition
    Gong, Yanxiang
    Deng, Linjie
    Ma, Zheng
    Xie, Mei
    [J]. NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1677 - 1688
  • [8] Deep learning-based clustering approaches for bioinformatics
    Karim, Md Rezaul
    Beyan, Oya
    Zappa, Achille
    Costa, Ivan G.
    Rebholz-Schuhmann, Dietrich
    Cochez, Michael
    Decker, Stefan
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) : 393 - 415
  • [9] Karthikeyan G., 2022, INT J PROG RES SCI E, V3, P57
  • [10] Capsmf: a novel product recommender system using deep learning based text analysis model
    Katarya, Rahul
    Arora, Yamini
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35927 - 35948