Tandem hidden Markov models using deep belief networks for offline handwriting recognition

被引:14
作者
Roy, Partha Pratim [1 ]
Zhong, Guoqiang [2 ]
Cheriet, Mohamed [3 ]
机构
[1] Indian Inst Technol Roorkee, Dept Comp Sci & Engn, Roorkee 247667, Uttar Pradesh, India
[2] Ocean Univ China, Dept Comp Sci & Technol, Qingdao 266100, Peoples R China
[3] Ecole Technol Super, Synchromedia Lab, Montreal, PQ H3C 1K3, Canada
基金
中国国家自然科学基金;
关键词
Handwriting recognition; Hidden Markov models; Deep learning; Deep belief networks; Tandem approach; CHARACTER;
D O I
10.1631/FITEE.1600996
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unconstrained offline handwriting recognition is a challenging task in the areas of document analysis and pattern recognition. In recent years, to sufficiently exploit the supervisory information hidden in document images, much effort has been made to integrate multi-layer perceptrons (MLPs) in either a hybrid or a tandem fashion into hidden Markov models (HMMs). However, due to the weak learnability of MLPs, the learnt features are not necessarily optimal for subsequent recognition tasks. In this paper, we propose a deep architecture-based tandem approach for unconstrained offline handwriting recognition. In the proposed model, deep belief networks are adopted to learn the compact representations of sequential data, while HMMs are applied for (sub-)word recognition. We evaluate the proposed model on two publicly available datasets, i.e., RIMES and IFN/ENIT, which are based on Latin and Arabic languages respectively, and one dataset collected by ourselves called Devanagari (an Indian script). Extensive experiments show the advantage of the proposed model, especially over the MLP-HMMs tandem approaches.
引用
收藏
页码:978 / 988
页数:11
相关论文
共 45 条
[1]  
[Anonymous], 2008, Proc. ICFHR
[2]  
[Anonymous], 2001, Neural Networks: A Comprehensive Foundation
[3]  
[Anonymous], 2009, Proceedings of the 4th Workshop on Statistical Machine Translation
[4]  
[Anonymous], 1994, Connectionist Speech Recognition: A Hybrid Approach
[5]  
Augustin E, 2006, P INT WORKSH FRONT H, P231
[6]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[7]   Hidden Markov model-based ensemble methods for offline handwritten text line recognition [J].
Bertolami, Roman ;
Bunke, Horst .
PATTERN RECOGNITION, 2008, 41 (11) :3452-3460
[8]   Dynamic and Contextual Information in HMM Modeling for Handwritten Word Recognition [J].
Bianne-Bernard, Anne-Laure ;
Menasri, Fares ;
Mohamad, Rami Al-Hajj ;
Mokbel, Chafic ;
Kermorvant, Christopher ;
Likforman-Sulem, Laurence .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (10) :2066-2080
[9]  
Bunke H, 2003, PROC INT CONF DOC, P448
[10]  
Dahl GE, 2011, INT CONF ACOUST SPEE, P4688