Identification of Mongolian and Chinese Species in Natural Scenes Based on Convolutional Neural Network

被引：0

作者：

Zhang, Jianxin

Hu, Chunxiao

机构：

来源：

2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年

关键词：

language recognition; LTCNN; text correction; TEXT;

D O I：

10.1109/CAC51589.2020.9327774

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multilingual texts are very common in natural scenes. Different languages have different shapes and structure. According to the language characteristics to choose the appropriate method can reduce the recognition error rate during text detection and recognition. This paper mainly proposes a lightweight convolutional neural network LTCNN for natural scene images mixed Ns it h Chinese characters and Mongolian. Firstly, collecting pictures contained both Chinese characters and Mongolian; then using perspective transformation to process the slanted text image, and using gamma transformation to enhance the image of underexposed or overexposed pictures;nest using the improved EAST model for the preprocessed pictures to extract text makes the data set needed for training the network; finally, the pictures in the data set are fed into the LTCNN network for training and identification of Mongolian and Chinese species. Experimental results show that the accuracy of Chinese and Mongolian recognition using this method can be close to 90% accuracy.

引用

页码：2699 / 2704

页数：6

共 15 条

[1] Ali A, 2018, 2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), P29, DOI 10.1109/ASAR.2018.8480202
[2] [Anonymous], 2009, CHIN C PATT REC
[3] Bazazian D., 2017, ARXIV170205089
[4] Chen XR, 2004, PROC CVPR IEEE, P366
[5] The PASCAL Visual Object Classes Challenge: A Retrospective
Everingham, Mark
Eslami, S. M. Ali
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
[6] Gonzalez R.C., 2005, Digital image processing
[7] Hou Yueyun, 2006, COMPUTER APPL, V26, P29
[8] Words Matter: Scene Text for Image Classification and Retrieval
Karaoglu, Sezer
Tao, Ran
Gevers, Theo
Smeulders, Arnold W. M.
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (05) : 1063 - 1076
[9] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
[10] PERSPECTIVE TRANSFORMATION
MEZIROW, J
[J]. ADULT EDUCATION, 1978, 28 (02): : 100 - 110

← 1 2 →