Math Spotting: Retrieving Math in Technical Documents Using Handwritten Query Images

被引:9
作者
Zanibbi, Richard [1 ]
Yu, Li [2 ]
机构
[1] Rochester Inst Technol, Dept Comp Sci, Rochester, NY 14623 USA
[2] Illinois Inst Technol, Dept Comp Sci, Chicago, IL USA
来源
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011) | 2011年
基金
美国国家科学基金会;
关键词
Mathematical Information Retrieval; Math Recognition; Keyword Spotting; OCR;
D O I
10.1109/ICDAR.2011.96
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method for locating mathematical expressions in document images without the use of optical character recognition is presented. An index of document regions is produced from recursive X-Y trees produced for each page in the corpus. Queries are provided as images of handwritten expressions, for which an X-Y tree is computed. During retrieval, the query is looked up in the document region index using features of its X-Y tree, producing a set of candidate regions. Candidate regions are ranked by the similarity of vertical pixel projections in their upper and lower halves with those of the query image, as computed using Dynamic Time Warping of the image columns. In an experiment, ten participants each wrote twenty queries from a 200-page corpus. On average, the top-10 retrieval candidates included a candidate covering 43.3% of the test query image (sigma = 14.0), with the correct page being returned between 30.0% and 85.0% of the time across participants (mu = 63.2%, sigma = 14.9%). When testing using the original query images, 90.0% of the queries were retrieved correctly.
引用
收藏
页码:446 / 451
页数:6
相关论文
共 28 条
[1]  
[Anonymous], 2008, LEARNING OPENCV COMP
[2]  
[Anonymous], 2010, THESIS
[3]  
Blostein Dorothea, 1997, Handbook of character recognition and document image analysis, P557
[4]   Image retrieval: Ideas, influences, and trends of the new age [J].
Datta, Ritendra ;
Joshi, Dhiraj ;
Li, Jia ;
Wang, James Z. .
ACM COMPUTING SURVEYS, 2008, 40 (02)
[5]   The indexing and retrieval of document images: A survey [J].
Doermann, D .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 70 (03) :287-298
[6]  
Einwohner T. H., 1995, Proceedings of the 1995 International Symposium on Symbolic and Algebraic Computation, ISSAC '95, P133, DOI 10.1145/220346.220364
[7]  
Garain Utpal, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1340, DOI 10.1109/ICDAR.2009.203
[8]  
Garain U, 2007, ADV PATTERN RECOGNIT, P235, DOI 10.1007/978-1-84628-726-8_11
[9]  
Jaekyu Ha, 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P952, DOI 10.1109/ICDAR.1995.602059
[10]   Automatic extraction of printed mathematical formulas using fuzzy logic and propagation of context [J].
Kacem A. ;
Belaïd A. ;
Ben Ahmed M. .
International Journal on Document Analysis and Recognition, 2001, Springer Verlag (04) :97-108