HIERARCHICAL HYBRID MLP/HMM OR RATHER MLP FEATURES FOR A DISCRIMINATIVELY TRAINED GAUSSIAN HMM: A COMPARISON FOR OFFLINE HANDWRITING RECOGNITION

被引:0
作者
Dreuw, Philippe [1 ]
Doetsch, Patrick [1 ]
Plahl, Christian [1 ]
Ney, Hermann [1 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, D-52056 Aachen, Germany
来源
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2011年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We use neural network based features extracted by a hierarchical multilayer-perceptron (MLP) network either in a hybrid MLP/HMM approach or to discriminatively retrain a Gaussian hidden Markov model (GHMM) system in a tandem approach. MLP networks have been successfully used to model long-term and non-linear features dependencies in automatic speech and optical character recognition. In offline handwriting recognition, MLPs have been mostly used for isolated character and word recognition in hybrid approaches. Here we analyze MLPs within an LVCSR framework for continuous handwriting recognition using discriminative MMI/MPE training. Especially hybrid MLP/HMM and discriminatively retrained MLP-GHMM tandem approaches are evaluated. Significant improvements and competitive results are reported for a closed-vocabulary task on the IfN/ENIT Arabic handwriting database and for a large-vocabulary task using the IAM English handwriting database.
引用
收藏
页数:4
相关论文
共 17 条
[1]  
[Anonymous], INTERSPEECH
[2]   Hidden Markov model-based ensemble methods for offline handwritten text line recognition [J].
Bertolami, Roman ;
Bunke, Horst .
PATTERN RECOGNITION, 2008, 41 (11) :3452-3460
[3]  
Biem AE, 2001, INT CONF ACOUST SPEE, P1529, DOI 10.1109/ICASSP.2001.941223
[4]  
Bourlard H., 1994, SERIES ENG COMPUTER, P247
[5]  
Dreuw Philippe, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P596, DOI 10.1109/ICDAR.2009.116
[6]  
Dreuw P., 2008, ICPR
[7]  
Dreuw P., 2011, IJDAR IN PRESS APR
[8]  
Espana-Boquera S., 2010, IEEE TPAMI
[9]   A Novel Connectionist System for Unconstrained Handwriting Recognition [J].
Graves, Alex ;
Liwicki, Marcus ;
Fernandez, Santiago ;
Bertolami, Roman ;
Bunke, Horst ;
Schmidhuber, Juergen .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (05) :855-868
[10]   DISCRIMINATIVE HMMS, LOG-LINEAR MODELS, AND CRFS: WHAT IS THE DIFFERENCE? [J].
Heigold, G. ;
Wiesler, S. ;
Nussbaum-Thom, M. ;
Lehnen, P. ;
Schlueter, R. ;
Ney, H. .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :5546-5549