Off-line handwritten Chinese character recognition as a compound Bayes decision problem

被引:23
作者
Wong, PK [1 ]
Chan, CK [1 ]
机构
[1] Univ Hong Kong, Dept Comp Sci & Informat Syst, Pokfulam Rd, Hong Kong, Peoples R China
关键词
off-line handwritten Chinese character recognition; Chinese language modeling; compound Bayes decision; contextual vector quantization; Chinese word segmentation;
D O I
10.1109/34.713366
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A handwritten Chinese character off-line recognizer based on Contextual Vector Quantization (CVQ) of every pixel of an unknown character image has been constructed. Each template character is represented by a codebook. When an unknown image is matched against a template character, each pixel of the image is quantized according to the associated codebook by considering not just the feature vector observed at each pixel, but those observed at its neighbors and their quantizations as well. Structural information such as stroke counts observed at each pixel are captured to form a cellular feature vector. Supporting a vocabulary of 4,616 simplified Chinese characters and alphanumeric and punctuation symbols, the writer-independent recognizer has an average recognition rate of 77.2 percent. Three statistical language models for postprocessing have been studied for their effectiveness in upgrading the recognition rate of the system. Among them. the CVQ-based language model is the most effective one upgrading the recognition rate by 10.4 percent on the average.
引用
收藏
页码:1016 / 1023
页数:8
相关论文
共 22 条