RECOGNITION-BASED SEGMENTATION OF ONLINE RUN-ON HANDPRINTED WORDS - INPUT VS OUTPUT SEGMENTATION

被引:11
作者
WEISSMAN, H [1 ]
SCHENKEL, M [1 ]
GUYON, I [1 ]
NOHL, C [1 ]
HENDERSON, D [1 ]
机构
[1] AT&T BELL LABS,HOLMDEL,NJ 07733
关键词
CHARACTER RECOGNITION; ONLINE CHARACTER RECOGNITION; NEURAL NETWORKS; TIME DELAY NEURAL NETWORKS; SEGMENTATION; RUN-ON HANDWRITING;
D O I
10.1016/0031-3203(94)90117-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of two methods for recognition-based segmentation of strings of on-line handprinted capital Latin characters is reported. The input strings consist of a time-ordered sequence of X, Y coordinates, punctuated by pen-lifts. The methods are designed to work in ''run-on mode'' where there is no constraint on the spacing between characters. While both methods use a neural network recognition engine and a graph-algorithmic post-processor, their approaches to segmentation are quite different. The first method, which we call INSEG (for input segmentation), uses a combination of heuristics to identify particular pen-lifts as tentative segmentation points. The second method, which we call OUTSEG (for output segmentation), relies on the empirically trained recognition engine for both recognizing characters and identifying relevant segmentation points. The best results are obtained with the INSEG method: 11% error on handprinted words from an 80,000 word dictionary.
引用
收藏
页码:405 / 420
页数:16
相关论文
共 23 条
[11]  
KEELER J, 1991, ADV NEURAL INFORMATI, V3, P557
[12]  
LANG KJ, 1988, CMUCS88152 CARN MELL
[13]  
LECUN Y, 1989, IEEE COMMUN MAG NOV, P41
[14]  
Levenshtein Vladimir I, 1966, SOV PHYS DOKL, V10, P707
[15]  
MATAN O, 1992, ADV NEURAL INFORMATI, V4
[16]  
MORGAN N, 1990, P ICASSP 90 ALBUQUER
[17]  
RUMELHART D, IN PRESS 3RD NEC S C
[18]  
Sakoe H, 1990, READINGS SPEECH RECO, P159, DOI [DOI 10.1016/B978-0-08-051584-7.50016-4, 10.1016/b978-0-08-051584-7.50016-4]
[19]  
SIZOV KA, 1991, P INT C DOCUMENT ANA, V2
[20]  
TEULING HL, 1992, PIXELS FEATURES, V3