Cellular wave computer algorithms with spatial semantic embedding for handwritten text recognition

被引:3
作者
Karacs, Kristof [1 ,2 ]
Proszeky, Gabor [2 ,3 ]
Roska, Tamas [1 ,2 ]
机构
[1] Hungarian Acad Sci, Anal & Neural Comp Lab, Comp & Automat Res Inst, H-1111 Budapest, Hungary
[2] Pazmany Peter Catholic Univ, Fac Informat Technol, Jedl Lab, H-1083 Budapest, Hungary
[3] MorphoLogic Ltd, H-1126 Budapest, Hungary
关键词
handwriting recognition; lexicon reduction; cellular wave computer; semantic embedding; WORD RECOGNITION; NEURAL-NETWORKS; MACHINE;
D O I
10.1002/cta.485
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The recognition of cursive handwritten texts is a complex, in some cases unsolvable, task. One problem is that in most cases it is difficult or impossible to identify each letter, even if the words are separated. In our new method, the identification of letters is not needed due to the extensive and iterative use of semantic and morphological information of a given language. We are using a spatial feature code, generated by a cellular nonlinear network (CNN) based cellular wave computer algorithm, and combine it with the linguistic properties of the given language. Most general-purpose handwriting recognition systems lack the ability to integrate linguistic background knowledge because they use it only for post-processing recognition results. The high-level a priori background knowledge is, however. crucial in human reading and similarly it can boost recognition rates dramatically in case of recognition systems. In our new system we do not treat the visual source as the only input: geometric and linguistic information are given equal importance. On the geometric side we use word-level holistic feature detection without letter segmentation by analogic CNN algorithms designed for cellular wave computers (IEEE Trans. Circuits Syst. 1993; 40:163-173; Cellular Neural Networks and Visual Computing, Foundations and Applications. Cambridge University Press: Cambridge, U.K., New York, 2002). The linguistic side is based on a morpho-syntactic linguistic system (Proceedings of COLING-2002, vol. II, Taipei, Taiwan, 2002; 1263-1267). A novel shape coding method is used to interface them, and their interaction is enhanced via an inverse filtering technique based on features that are global or of a low confidence value. A statistical context selection method is also applied to further reduce the output word lists. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:1019 / 1050
页数:32
相关论文
共 30 条
[1]  
Chua L. O., 2002, Cellular neural networks and visual computing: foundations and applications
[2]  
Cote M, 1997, PROC INT CONF DOC, P107, DOI 10.1109/ICDAR.1997.619823
[3]   Immune response inspired spatial-temporal target detection algorithms with CNN-UM [J].
Cserey, G ;
Falus, A ;
Roska, T .
INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2006, 34 (01) :21-47
[4]   Computational auditory scene analysis in cellular wave computing framework [J].
Fodroczi, Zoltan ;
Radvanyi, Andras .
INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND APPLICATIONS, 2006, 34 (04) :489-515
[5]  
Gibbon D., 1997, Handbook of standards and resources for spoken language systems
[6]  
GUILLEVIC D, 2000, P 7 INT WORKSH FRONT, P373
[7]  
KARACS K, 2004, P 8 IEEE INT WORKSH, P364
[8]  
KARAES KG, 2003, P 16 EUR C CIRC THEO, P409
[9]  
Kek L., 2007, CELLULAR WAVE COMPUT
[10]   Large vocabulary off-line handwriting recognition: A survey [J].
Koerich, AL ;
Sabourin, R ;
Suen, CY .
PATTERN ANALYSIS AND APPLICATIONS, 2003, 6 (02) :97-121