A Survey on Document Image Processing Methods Useful for Assistive Technology for the Blind

被引:11
作者
Keefer, Robert [1 ]
Bourbakis, Nikolaos [2 ]
机构
[1] Pomiet LLC, Dayton, OH 45449 USA
[2] Wright State Univ, Dayton, OH 45449 USA
关键词
Document image processing; skew correction; page curl correction; document image segmentation;
D O I
10.1142/S0219467815500059
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper offers a review of the state-of-the-art document image processing methods and their classification by identifying new trends for automatic document processing and understanding. Document image processing (DIP) is an important problem related with most of the challenges coming from the image processing field and with applications to digital document summarization, readers for the visually impaired etc. Difficulties in the processing of documents can arise from lighting conditions, page curl, page rotation in 3D, and page layout segmentation. Document image processing is usually performed in the context of higher-level applications that require an undistorted document image such as optical character recognition and document restoration/preservation. Typically, assumptions are made to constrain the processing problem in the context of a particular application. In this survey, we categorize document image processing methods on the basis of the technique, provide detailed descriptions of representative methods in each category, and examine their pros and cons. It important to notice here that the DIP field is broad, thus we try to provide a top-down/horizontal survey rather a bottom up. At the same time, we target the area of document readers for the blind, and use this application to guide us in a top-down survey of DIP. Moreover, we present a comparative survey based on important aspects of a marketable system that is dependent on document image processing techniques.
引用
收藏
页数:35
相关论文
共 59 条
[1]  
Baird H.S., 1987, P C SOC PHOT SCI ENG, P14
[2]  
Barkan E., 2007, U.S. Patent, Patent No. [7,204,420, 7204420]
[3]   A methodology of separating images from text using an OCR approach [J].
Bourbakis, NG .
IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS, PROCEEDINGS, 1996, :311-317
[4]  
Breuel TM, 2002, P SOC PHOTO-OPT INS, V4670, P20
[5]   Image restoration of arbitrarily warped documents [J].
Brown, MS ;
Seales, WB .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (10) :1295-1306
[6]  
Cao HG, 2003, PROC INT CONF DOC, P71
[7]   Document page segmentation using neuro-fuzzy approach [J].
Caponetti, Laura ;
Castiello, Ciro ;
Gorecki, Przemyslaw .
APPLIED SOFT COMPUTING, 2008, 8 (01) :118-126
[8]  
CARBERRY S, 2003, P 4 SIGDIAL WORKSH D, P1
[9]   Rectifying perspective views of text in 3D scenes using vanishing points [J].
Clark, P ;
Mirmehdi, M .
PATTERN RECOGNITION, 2003, 36 (11) :2673-2686
[10]   Recognising text in real scenes [J].
Clark P. ;
Mirmehdi M. .
International Journal on Document Analysis and Recognition, 2002, 4 (4) :243-257