Identification of Mongolian and Chinese Species in Natural Scenes Based on Convolutional Neural Network

被引:0
作者
Zhang, Jianxin
Hu, Chunxiao
机构
来源
2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年
关键词
language recognition; LTCNN; text correction; TEXT;
D O I
10.1109/CAC51589.2020.9327774
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multilingual texts are very common in natural scenes. Different languages have different shapes and structure. According to the language characteristics to choose the appropriate method can reduce the recognition error rate during text detection and recognition. This paper mainly proposes a lightweight convolutional neural network LTCNN for natural scene images mixed Ns it h Chinese characters and Mongolian. Firstly, collecting pictures contained both Chinese characters and Mongolian; then using perspective transformation to process the slanted text image, and using gamma transformation to enhance the image of underexposed or overexposed pictures;nest using the improved EAST model for the preprocessed pictures to extract text makes the data set needed for training the network; finally, the pictures in the data set are fed into the LTCNN network for training and identification of Mongolian and Chinese species. Experimental results show that the accuracy of Chinese and Mongolian recognition using this method can be close to 90% accuracy.
引用
收藏
页码:2699 / 2704
页数:6
相关论文
共 15 条
  • [1] Ali A, 2018, 2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), P29, DOI 10.1109/ASAR.2018.8480202
  • [2] [Anonymous], 2009, CHIN C PATT REC
  • [3] Bazazian D., 2017, ARXIV170205089
  • [4] Chen XR, 2004, PROC CVPR IEEE, P366
  • [5] The PASCAL Visual Object Classes Challenge: A Retrospective
    Everingham, Mark
    Eslami, S. M. Ali
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
  • [6] Gonzalez R.C., 2005, Digital image processing
  • [7] Hou Yueyun, 2006, COMPUTER APPL, V26, P29
  • [8] Words Matter: Scene Text for Image Classification and Retrieval
    Karaoglu, Sezer
    Tao, Ran
    Gevers, Theo
    Smeulders, Arnold W. M.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (05) : 1063 - 1076
  • [9] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [10] PERSPECTIVE TRANSFORMATION
    MEZIROW, J
    [J]. ADULT EDUCATION, 1978, 28 (02): : 100 - 110