BLSTM Neural Network based Word Retrieval for Hindi Documents

被引：7

作者：

Jain, Raman ^{[1
]}

Frinken, Volkmar ^{[2
]}

Jawahar, C. V. ^{[1
]}

Manmatha, R. ^{[3
]}

机构：

[1] Int Inst Informat Technol Hyderabad, Ctr Visual Informat Technol, Hyderabad, Andhra Pradesh, India

[2] Univ Bern, Inst Comp Sci & Appl Math, CH-3012 Bern, Switzerland

[3] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA

来源：

11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年

关键词：

BLSTM neural network; Word Image Retrieval; Indian languages; Hindi; RECOGNITION;

D O I：

10.1109/ICDAR.2011.26

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Retrieval from Hindi document image collections is a challenging task. This is partly due to the complexity of the script, which has more than 800 unique ligatures. In addition, segmentation and recognition of individual characters often becomes difficult due to the writing style as well as degradations in the print. For these reasons, robust OCRs are non existent for Hindi. Therefore, Hindi document repositories are not amenable to indexing and retrieval. In this paper, we propose a scheme for retrieving relevant Hindi documents in response to a query word. This approach uses BLSTM neural networks. Designed to take contextual information into account, these networks can handle word images that can not be robustly segmented into individual characters. By zoning the Hindi words, we simplify the problem and obtain high retrieval rates. Our simplification suits the retrieval problem, while it does not apply to recognition. Our scalable retrieval scheme avoids explicit recognition of characters. An experimental evaluation on a dataset of word images gathered from two complete books demonstrates good accuracy even in the presence of printing variations and degradations. The performance is compared with baseline methods.

引用

页码：83 / 87

页数：5

共 12 条

[1]

Balasubramanian A, 2006, LECT NOTES COMPUT SC, V3872, P1

[2]

Bhardwaj A., 2008, DRR, P68150

[3]

Chaudhuri BB, 1997, PROC INT CONF DOC, P1011, DOI 10.1109/ICDAR.1997.620662

[4]

Frinken V., 2010, Proceedings 2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010), P352, DOI 10.1109/ICFHR.2010.61

[5] A Novel Connectionist System for Unconstrained Handwriting Recognition [J].

Graves, Alex ;

Liwicki, Marcus ;

Fernandez, Santiago ;

Bertolami, Roman ;

Bunke, Horst ;

Schmidhuber, Juergen .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (05) :855-868

[6]

Jawahar CV, 2007, PROC INT CONF DOC, P799

[7] Devanagari OCR using a recognition driven segmentation framework and stochastic language models [J].

Kompalli, Suryaprakash ;

Setlur, Srirangaraj ;

Govindaraju, Venu .

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2009, 12 (02) :123-138

[8] Holistic word recognition for handwritten historical documents [J].

Lavrenko, V ;

Rath, TM ;

Manmatha, R .

FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, :278-287

[9]

Natarajan P., 2010, GUIDE TO OCR FOR IND

[10] Indian script character recognition: a survey [J].

Pal, U ;

Chaudhuri, BB .

PATTERN RECOGNITION, 2004, 37 (09) :1887-1899

← 1 2 →