A Novel Word-Spotting Method for Handwritten Documents Using an Optimization-Based Classifier

被引:2
作者
Tavoli, Reza [1 ]
Keyvanpour, Mohammadreza [2 ]
机构
[1] Islamic Azad Univ, Qazvin Branch, Dept Comp Engn, Qazvin, Iran
[2] Alzahra Univ, Dept Comp Engn, Deh E Vanak St, Tehran 1993893973, Iran
关键词
RETRIEVAL;
D O I
10.1080/08839514.2017.1346964
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word spotting is the answer to the question whether the document contains the user's query word. One of the main challenges of keyword spotting at the testing stage is that some testing non-classes are not included in training classes. Hence, this paper presents a robust handwritten word-spotting method for handwritten documents using genetic programming (GP). Using this technique, a tree is created as a classifier which separates the target class (keyword) from the other classes (non-keyword). The new components of the proposed classifier include proper chromosome and new classification fitness function. The proposed chromosome was based on the relationship between features and each chromosome (tree) mapped the features to a real number. Then, a margin was obtained from the real number. To evaluate the generality of the proposed method, several experiments have been designed and implemented on three standard datasets (namely IFN/ENIT Arabic for Arabic, IFN/Farsi for Persian, and George Washington for English). The results of experiments carried out on these three datasets show that the proposed method has much higher precision and recall than previous methods
引用
收藏
页码:346 / 375
页数:30
相关论文
共 53 条
[31]  
Rath TM, 2003, PROC CVPR IEEE, P521
[32]  
Rath TM, 2003, PROC INT CONF DOC, P218
[33]  
Riba P, 2015, PROC INT CONF DOC, P781, DOI 10.1109/ICDAR.2015.7333868
[34]  
RODRIGUEZ JA, 2008, INT C FRONT HANDWR R
[35]   A Model-Based Sequence Similarity with Application to Handwritten Word Spotting [J].
Rodriguez-Serrano, Jose A. ;
Perronnin, Florent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2108-2120
[36]   Handwritten word-spotting using hidden Markov models and universal vocabularies [J].
Rodriguez-Serrano, Jose A. ;
Perronnin, Florent .
PATTERN RECOGNITION, 2009, 42 (09) :2106-2116
[37]  
ROSE RC, 1990, INT CONF ACOUST SPEE, P129, DOI 10.1109/ICASSP.1990.115555
[38]  
Rothfeder J.L., 2003, P C COMPUTER VISION, P30
[39]   Boosting the handwritten word spotting experience by including the user in the loop [J].
Rusinol, Marcal ;
Llados, Josep .
PATTERN RECOGNITION, 2014, 47 (03) :1063-1072
[40]   Browsing Heterogeneous Document Collections by a Segmentation-free Word Spotting Method [J].
Rusinol, Marcal ;
Aldavert, David ;
Toledo, Ricardo ;
Llados, Josep .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :63-67