Tandem hidden Markov models using deep belief networks for offline handwriting recognition

被引:0
作者
Partha Pratim Roy
Guoqiang Zhong
Mohamed Cheriet
机构
[1] Indian Institute of Technology Roorkee,Department of Computer Science & Engineering
[2] Ocean University of China,Department of Computer Science and Technology
[3] École de Technologie Supérieure,Synchromedia Laboratory
来源
Frontiers of Information Technology & Electronic Engineering | 2017年 / 18卷
关键词
Handwriting recognition; Hidden Markov models; Deep learning; Deep belief networks; Tandem approach; TP391;
D O I
暂无
中图分类号
学科分类号
摘要
Unconstrained offline handwriting recognition is a challenging task in the areas of document analysis and pattern recognition. In recent years, to sufficiently exploit the supervisory information hidden in document images, much effort has been made to integrate multi-layer perceptrons (MLPs) in either a hybrid or a tandem fashion into hidden Markov models (HMMs). However, due to the weak learnability of MLPs, the learnt features are not necessarily optimal for subsequent recognition tasks. In this paper, we propose a deep architecture-based tandem approach for unconstrained offline handwriting recognition. In the proposed model, deep belief networks are adopted to learn the compact representations of sequential data, while HMMs are applied for (sub-)word recognition. We evaluate the proposed model on two publicly available datasets, i.e., RIMES and IFN/ENIT, which are based on Latin and Arabic languages respectively, and one dataset collected by ourselves called Devanagari (an Indian script). Extensive experiments show the advantage of the proposed model, especially over the MLP-HMMs tandem approaches.
引用
收藏
页码:978 / 988
页数:10
相关论文
共 57 条
[1]  
Baum L.E.(1970)A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains Ann. Math. Statist. 41 164-171
[2]  
Petrie T.(2008)Hidden Markov modelbased ensemble methods for offline handwritten text line recognition Patt. Recog. 41 3452-3460
[3]  
Soules G.(2011)Dynamic and contextual information in HMM modeling for handwritten word recognition IEEE Trans. Patt. Anal. Mach. Intell. 33 2066-2080
[4]  
Bertolami R.(2011)Confidence-and margin-based MMI/MPE discriminative training for offline handwriting recognition Int. J. Doc. Anal. Recog. 14 273-288
[5]  
Bunke H.(1999)An HMM-based approach for off-line unconstrained handwritten word modeling and recognition IEEE Trans. Patt. Anal. Mach. Intell. 21 752-760
[6]  
Bianne-Bernard A.L.(2011)Improving offline handwritten text recognition with hybrid HMM/ANN models IEEE Trans. Patt. Anal. Mach. Intell. 33 767-779
[7]  
Menasri F.(2008)Forty years of research in character and document recognition—an industrial perspective Patt. Recog. 41 2435-2446
[8]  
Mohamad R.A.H.(2009)A novel connectionist system for unconstrained handwriting recognition IEEE Trans. Patt. Anal. Mach. Intell. 31 855-868
[9]  
Dreuw P.(2002)Training products of experts by minimizing contrastive divergence Neur. Comput. 14 1771-1800
[10]  
Heigold G.(2006)A fast learning algorithm for deep belief nets Neur. Comput. 18 1527-1554