Principal Component 2-D Long Short-Term Memory for Font Recognition on Single Chinese Characters

被引:78
作者
Tao, Dapeng [1 ]
Lin, Xu [2 ]
Jin, Lianwen [2 ]
Li, Xuelong [3 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650091, Peoples R China
[2] S China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510640, Guangdong, Peoples R China
[3] Chinese Acad Sci, Xian Inst Opt & Precis Mech, State Key Lab Transient Opt & Photon, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian 710119, Shaanxi, Peoples R China
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Font recognition; long short-term memory; neurodynamic models; optical character recognition; recurrent neural networks (RNNs); NEURAL-NETWORK; FEATURE-EXTRACTION; FEATURE-SELECTION; CLASSIFICATION; FEATURES; ALGORITHMS; ONLINE; IMAGES; SCALE;
D O I
10.1109/TCYB.2015.2414920
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Chinese character font recognition (CCFR) has received increasing attention as the intelligent applications based on optical character recognition becomes popular. However, traditional CCFR systems do not handle noisy data effectively. By analyzing in detail the basic strokes of Chinese characters, we propose that font recognition on a single Chinese character is a sequence classification problem, which can be effectively solved by recurrent neural networks. For robust CCFR, we integrate a principal component convolution layer with the 2-D long short-term memory (2DLSTM) and develop principal component 2DLSTM (PC-2DLSTM) algorithm. PC-2DLSTM considers two aspects: 1) the principal component layer convolution operation helps remove the noise and get a rational and complete font information and 2) simultaneously, 2DLSTM deals with the long-range contextual processing along scan directions that can contribute to capture the contrast between character trajectory and background. Experiments using the frequently used CCFR dataset suggest the effectiveness of PC-2DLSTM compared with other state-of-the-art font recognition methods.
引用
收藏
页码:756 / 765
页数:10
相关论文
共 59 条
[1]   Biometric Recognition Based on Free-Text Keystroke Dynamics [J].
Ahmed, Ahmed A. ;
Traore, Issa .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (04) :458-472
[2]  
[Anonymous], 2005, PROC CVPR IEEE
[3]  
[Anonymous], 2008, Advances in neural information processing systems, DOI DOI 10.1007/978-1-4471-4072-6_12
[4]   Sparse Extreme Learning Machine for Classification [J].
Bai, Zuo ;
Huang, Guang-Bin ;
Wang, Danwei ;
Wang, Han ;
Westover, M. Brandon .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (10) :1858-1870
[5]   Efficient Algorithms for Exact Inference in Sequence Labeling SVMs [J].
Bauer, Alexander ;
Goernitz, Nico ;
Biegler, Franziska ;
Mueller, Klaus-Robert ;
Kloft, Marius .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) :870-881
[6]   New features using fractal multi-dimensions for generalized Arabic font recognition [J].
Ben Moussa, Sami ;
Zahour, Abderrazak ;
Benabdelhafid, Abdellatif ;
Alimi, Adel M. .
PATTERN RECOGNITION LETTERS, 2010, 31 (05) :361-371
[7]   Manifold Adaptive Experimental Design for Text Categorization [J].
Cai, Deng ;
He, Xiaofei .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (04) :707-719
[8]   Robust Subspace Segmentation Via Low-Rank Representation [J].
Chen, Jinhui ;
Yang, Jian .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (08) :1432-1445
[9]   Nonnegative Local Coordinate Factorization for Image Representation [J].
Chen, Yan ;
Zhang, Jiemi ;
Cai, Deng ;
Liu, Wei ;
He, Xiaofei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (03) :969-979
[10]   Necessary and Sufficient Conditions for Consensus of Double-Integrator Multi-Agent Systems With Measurement Noises [J].
Cheng, Long ;
Hou, Zeng-Guang ;
Tan, Min ;
Wang, Xu .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (08) :1958-1963