A Novel Word-Spotting Method for Handwritten Documents Using an Optimization-Based Classifier

被引:2
作者
Tavoli, Reza [1 ]
Keyvanpour, Mohammadreza [2 ]
机构
[1] Islamic Azad Univ, Qazvin Branch, Dept Comp Engn, Qazvin, Iran
[2] Alzahra Univ, Dept Comp Engn, Deh E Vanak St, Tehran 1993893973, Iran
关键词
RETRIEVAL;
D O I
10.1080/08839514.2017.1346964
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word spotting is the answer to the question whether the document contains the user's query word. One of the main challenges of keyword spotting at the testing stage is that some testing non-classes are not included in training classes. Hence, this paper presents a robust handwritten word-spotting method for handwritten documents using genetic programming (GP). Using this technique, a tree is created as a classifier which separates the target class (keyword) from the other classes (non-keyword). The new components of the proposed classifier include proper chromosome and new classification fitness function. The proposed chromosome was based on the relationship between features and each chromosome (tree) mapped the features to a real number. Then, a margin was obtained from the real number. To evaluate the generality of the proposed method, several experiments have been designed and implemented on three standard datasets (namely IFN/ENIT Arabic for Arabic, IFN/Farsi for Persian, and George Washington for English). The results of experiments carried out on these three datasets show that the proposed method has much higher precision and recall than previous methods
引用
收藏
页码:346 / 375
页数:30
相关论文
共 53 条
[1]   Word matching using single closed contours for indexing handwritten historical documents [J].
Adamek, Tornasz ;
O'Connor, Noel E. ;
Smeaton, Alan F. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) :153-165
[2]  
Al-Khayat M., 2014, THESIS
[3]   A study of Bag-of-Visual-Words representations for handwritten keyword spotting [J].
Aldavert, David ;
Rusinol, Marcal ;
Toledo, Ricardo ;
Llados, Josep .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (03) :223-234
[4]  
[Anonymous], THESIS
[5]  
[Anonymous], 2006, 10 INT WORKSH FRONT
[6]   Special issue on the analysis of historical documents [J].
Antonacopoulos, Apostolos ;
Downton, Andy C. .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) :75-77
[7]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[8]  
Bhardwaj A., 2008, 2nd Intl Workshop on Cross Lingual Information Access, P48
[9]  
Bin Zhang, 2003, Proceedings of the SPIE - The International Society for Optical Engineering, V5296, P45, DOI 10.1117/12.523968
[10]  
Cheriet M., 2012, GUIDE OCR ARABIC SCR, P453