TANDEM HMM WITH CONVOLUTIONAL NEURAL NETWORK FOR HANDWRITTEN WORD RECOGNITION

被引:0
作者
Bluche, Theodore
Ney, Hermann
Kermorvant, Christopher
机构
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Handwriting recognition; Hidden Markov Model; Convolutional Neural Network;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate the combination of hidden Markov models and convolutional neural networks for handwritten word recognition. The convolutional neural networks have been successfully applied to various computer vision tasks, including handwritten character recognition. In this work, we show that they can replace Gaussian mixtures to compute emission probabilities in hidden Markov models (hybrid combination), or serve as feature extractor for a standard Gaussian HMM system (tandem combination). The proposed systems outperform a basic HMM based on either decorrelated pixels or handcrafted features. We validated the approach on two publicly available databases, and we report up to 60% (Rimes) and 35% (IAM) relative improvement compared to a Gaussian HMM based on pixel values. The final systems give comparable results to recurrent neural networks, which are the best systems since 2009.
引用
收藏
页码:2390 / 2394
页数:5
相关论文
共 21 条
[1]  
[Anonymous], INT C DOC AN REC
[2]  
[Anonymous], INT C IM PROC
[3]  
[Anonymous], 2006, WORKSH FRONT HANDWR
[4]  
[Anonymous], 2011, WORKSH AUT SPEECH RE
[5]  
[Anonymous], 2010, INT WORKSH FRONT AR
[6]   LEREC - A NN/HMM HYBRID FOR ONLINE HANDWRITING RECOGNITION [J].
BENGIO, Y ;
LECUN, Y ;
NOHL, C ;
BURGES, C .
NEURAL COMPUTATION, 1995, 7 (06) :1289-1303
[7]  
Bengio Yoshua., 1994, Neural Information Processing Systems
[8]   Dynamic and Contextual Information in HMM Modeling for Handwritten Word Recognition [J].
Bianne-Bernard, Anne-Laure ;
Menasri, Fares ;
Mohamad, Rami Al-Hajj ;
Mokbel, Chafic ;
Kermorvant, Christopher ;
Likforman-Sulem, Laurence .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (10) :2066-2080
[9]  
Bourlard H., 1994, KLUWER INT SERIES EN, V247
[10]   A structural and relational approach to handwritten word recognition [J].
Buse, R ;
Liu, ZQ ;
Caelli, T .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1997, 27 (05) :847-861