Hmm-based system for transcribing Chinese handwriting

被引:0
作者
Su, Tong-Hua [1 ]
Zhang, Tian-Wen [1 ]
Qiu, Zhao-Wen [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Northeast Forestry Univ, Inst Informat & Comp Engn, Harbin 150040, Peoples R China
来源
PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7 | 2007年
关键词
Hidden Markov models; Chinese characters; optical character recognition; handwriting recognition; sliding window;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel recognition strategy is proposed for the transcription of Chinese handwritten documents. The recognizer adapts continuous density Hidden Markov Model (HMM) as the recognition engine. It incorporates character segmentation and recognition in one step avoiding character segmentation phase. Textline is extracted and converted to observation sequence by sliding windows first. Then Baum-Welch algorithm is used to train character HMMs. Finally, best character string in maximizing a posteriori criterion is found out through Viterbi algorithm as output. Experiments are conducted on a writer-dependent Chinese handwriting database with a 1,695 lexicon. The results show that our baseline recognizer outperforms much one popular commercial handwritten character recognition product and the strategy presented in this paper is a promising research direction.
引用
收藏
页码:3412 / +
页数:2
相关论文
共 15 条
[1]  
Chen YB, 1997, PROC INT CONF DOC, P206, DOI 10.1109/ICDAR.1997.619842
[2]   An HMM-based approach for off-line unconstrained handwritten word modeling and recognition [J].
El-Yacoubi, A ;
Gilloux, M ;
Sabourin, R ;
Suen, CY .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (08) :752-760
[3]  
FENG B, 2002, P 16 INT C PATT REC, P212
[4]  
LI Y, 2002, ACM T ASIAN LANGUAGE, V1, P297
[5]   Contextual post-processing based on the confusion matrix in offline handwritten Chinese script recognition [J].
Li, YX ;
Tan, CL ;
Ding, XQ ;
Liu, CS .
PATTERN RECOGNITION, 2004, 37 (09) :1901-1912
[6]  
Liu CL, 2005, PROC INT CONF DOC, P846
[7]   Using a statistical language model to improve the performance of an HMM-based cursive handwriting recognition system [J].
Marti, UV ;
Bunke, H .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (01) :65-90
[8]   Multilingual machine printed OCR [J].
Natarajan, P ;
Lu, ZD ;
Schwartz, R ;
Bazzi, I ;
Makhoul, J .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (01) :43-63
[9]   THRESHOLD SELECTION METHOD FROM GRAY-LEVEL HISTOGRAMS [J].
OTSU, N .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1979, 9 (01) :62-66
[10]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286