Fast HMM-Filler approach for Key Word Spotting in Handwritten Documents

被引:17
作者
Hector Toselli, Alejandro [1 ]
Vidal, Enrique [1 ]
机构
[1] Univ Politecn Valencia, Valencia 46022, Spain
来源
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2013年
关键词
D O I
10.1109/ICDAR.2013.106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The so-called filler or garbage Hidden Markov Models (HMM) are among the most widely used models for lexicon-free, query by string key word spotting in the fields of speech recognition and (lately) handwritten text recognition. An important drawback of this approach is the large computational cost of the keyword-specific HMM Viterbi decoding process needed to obtain the confidence scores of each word to be spotted. This paper presents a novel way to compute such confidence scores, directly from character lattices produced during a single Viterbi decoding process using only the "filler" model (i.e. no explicit keyword-specific decoding is needed). Experiments show that, as compared with the classical HMM-filler approach, the proposed method obtains essentially the same spotting results, while requiring between one and two orders of magnitude less query computing time.
引用
收藏
页码:501 / 505
页数:5
相关论文
共 13 条
[1]  
[Anonymous], 1997, HTK BOOK HIDDEN MARK
[2]  
[Anonymous], 2008, Introduction to information retrieval
[3]  
[Anonymous], 2008, P 31 ANN INT ACM SIG, DOI DOI 10.1145/1390334.1390453
[4]   Lexicon-free handwritten word spotting using character HMMs [J].
Fischer, Andreas ;
Keller, Andreas ;
Frinken, Volkmar ;
Bunke, Horst .
PATTERN RECOGNITION LETTERS, 2012, 33 (07) :934-942
[5]   A Novel Word Spotting Method Based on Recurrent Neural Networks [J].
Frinken, Volkmar ;
Fischer, Andreas ;
Manmatha, R. ;
Bunke, Horst .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) :211-224
[6]  
Jelinek F., 1998, STAT METHODS SPEECH
[7]   Discriminative keyword spotting [J].
Keshet, Joseph ;
Grangier, David ;
Bengio, Samy .
SPEECH COMMUNICATION, 2009, 51 (04) :317-329
[8]  
Lleida E., 1993, P 3 EUR C SPEECH COM, P1265
[9]   The IAM-database: An English sentence database for offline handwriting recognition [J].
U.-V. Marti ;
H. Bunke .
International Journal on Document Analysis and Recognition, 2002, 5 (1) :39-46
[10]  
Romero V, 2007, LECT NOTES COMPUT SC, V4633, P1182