Chinese Character CAPTCHA Recognition and performance estimation via deep neural network

被引:29
|
作者
Lin, Dazhen [1 ,2 ,4 ]
Lin, Fan [3 ]
Lv, Yanping [1 ,2 ]
Cai, Feipeng [1 ,2 ]
Cao, Donglin [1 ,2 ,4 ]
机构
[1] Xiamen Univ, Cognit Sci Dept, Xiamen, Peoples R China
[2] Fujian Key Lab Machine Intelligence & Robot, Xiamen, Peoples R China
[3] Xiamen Univ, Software Sch, Xiamen, Peoples R China
[4] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou, Fujian, Peoples R China
关键词
Chinese Character CAPTCHA Recognition; Web application; Performance estimation; Exponential relationship;
D O I
10.1016/j.neucom.2017.02.105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To identify machine and human, Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA) is increasingly used in many web applications. The classical English and digital characters based CAPTCHAs are recognized with high accuracy. Due to the complication of Chinese characters which greatly enhance the difficulty of automatic recognition, an increasing number of Chinese web sites use Chinese Character CAPTCHAs. To recognize Chinese Character CAPTCHAs, we propose a Convolution Neural Network (CNN) based approach to learn strokes, radicals and character features of Chinese characters, and prove that our network structure is superior to LENET-5 in this task. Furthermore, we formulate the relation among accuracy, the number of training samples and iterations, which is used to estimate the performance of our approach. Firstly, this approach greatly improves the recognition accuracy of Chinese Character CAPTCHAs with distortion, rotation and background noise. Our experiments results show that this approach achieves over 95% accuracy for single Chinese character and 84% accuracy for three types of Chinese Character CAPTCHAs with four Chinese characters. Secondly, our experiment results and theoretical analysis show that the accuracy of recognition has the exponential relationship with the product of the number of training samples and iterations in the condition of enough and representative training samples. Therefore, we can estimate the training time for a certain accuracy. Finally, we certify that our approach is superior to the most famous Chinese Optical Character Recognition (OCR) software, Hanvon, in Chinese Character CAPTCHAs recognition. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:11 / 19
页数:9
相关论文
共 5 条
  • [1] Improving Performance Estimation for Design Space Exploration for Convolutional Neural Network Accelerators
    Ferianc, Martin
    Fan, Hongxiang
    Manocha, Divyansh
    Zhou, Hongyu
    Liu, Shuanglong
    Niu, Xinyu
    Luk, Wayne
    ELECTRONICS, 2021, 10 (04) : 1 - 14
  • [2] Automatic Generation of Fast and Accurate Performance Models for Deep Neural Network Accelerators
    Luebeck, Konstantin
    Jung, Alexander Louis-Ferdinand
    Wedlich, Felix
    Mueller, Mika Markus
    Peccia, Federico Nicolas
    Thoemmes, Felix
    Steinmetz, Jannik
    Biermaier, Valentin
    Frischknecht, Adrian
    Bernardo, Paul Palomero
    Bringmann, Oliver
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2025, 24 (02)
  • [3] Aries: Efficient Testing of Deep Neural Networks via Labeling-Free Accuracy Estimation
    Hu, Qiang
    Guo, Yuejun
    Xie, Xiaofei
    Cordy, Maxime
    Papadakis, Mike
    Ma, Lei
    Le Traon, Yves
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 1776 - 1787
  • [4] Classifier performance estimation under the constraint of a finite sample size: Resampling schemes applied to neural network classifiers
    Sahiner, Berkman
    Chan, Heang-Ping
    Hadjiiski, Lubomir
    NEURAL NETWORKS, 2008, 21 (2-3) : 476 - 483
  • [5] Experimental implementation of artificial neural network for cost effective and non-intrusive performance estimation of air conditioning systems
    Sholahudin
    Giannetti, Niccolo
    Yamaguchi, Seiichi
    Saito, Kiyoshi
    Miyaoka, Yoichi
    Tanaka, Katsuhiko
    Ogami, Hiroto
    APPLIED THERMAL ENGINEERING, 2020, 181