Document dewarping via text-line based optimization

被引:42
作者
Kim, Beom Su [1 ]
Koo, Hyung Il [2 ]
Cho, Nam Ik [1 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, INMC, Seoul 151, South Korea
[2] Ajou Univ, Dept Elect & Comp Engn, Suwon 441749, South Korea
基金
新加坡国家研究基金会;
关键词
Document image; Document rectification; Document dewarping; Generalized cylindrical surface; Optical character recognition; Text-line based dewarping; SHADING CORRECTION; PRINTED MATERIALS; IMAGES; RECTIFICATION; RESTORATION; IDENTIFICATION;
D O I
10.1016/j.patcog.2015.04.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new document image dewarping method that removes geometric distortions in camera-captured document images. The proposed method does not directly use the text-line which has been the most widely used feature for the document dewarping. Instead, we use the discrete representation of text-lines and text-blocks which are the sets of connected components. Also, we model the geometric distortions caused by page curl and perspective view as the generalized cylindrical surfaces and camera rotation respectively. With these distortion models and the discrete representation of the features, we design a cost function whose minimization yields the parameters of the distortion model. In the cost function, we encode the properties of the pages such as text-block alignment, line-spacing, and the straightness of text-lines. By describing the text features using the sets of discrete points, the cost function can be easily defined and efficiently solved by Levenberg-Marquadt algorithm. Experiments show that the proposed method works well for the various layouts and curved surfaces, and compares favorably with the conventional methods on the standard dataset. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3600 / 3614
页数:15
相关论文
共 44 条
[1]  
[Anonymous], P 2 INT WORKSH CAM B
[2]  
[Anonymous], 2004, Multiple view geometry in computer vision, DOI DOI 10.1017/CBO9780511811685
[3]  
[Anonymous], 2009, P 3 INT WORKSH CAM B
[4]   Automatic panoramic image stitching using invariant features [J].
Brown, Matthew ;
Lowe, David G. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 74 (01) :59-73
[5]   Restoring 2D content from distorted documents [J].
Brown, Michael S. ;
Sun, Mingxuan ;
Yang, Ruigang ;
Yun, Lin ;
Seales, W. Brent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (11) :1904-1916
[6]   Geometric and shading correction for images of printed materials using boundary [J].
Brown, Michael S. ;
Tsoi, Yau-Chat .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (06) :1544-1554
[7]   A cylindrical surface model to rectify the bound document image [J].
Cao, HG ;
Ding, XQ ;
Liu, CS .
NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, :228-233
[8]   Shape from shading for the digitization of curved documents [J].
Courteille, Frederic ;
Crouzil, Alain ;
Durou, Jean-Denis ;
Gurdjos, Pierre .
MACHINE VISION AND APPLICATIONS, 2007, 18 (05) :301-316
[9]   Towards Mobile Document Image Retrieval for Digital Library [J].
Duan, Ling-Yu ;
Ji, Rongrong ;
Chen, Zhang ;
Huang, Tiejun ;
Gao, Wen .
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (02) :346-359
[10]   SOME IMPLEMENTATIONS OF THE BOXPLOT [J].
FRIGGE, M ;
HOAGLIN, DC ;
IGLEWICZ, B .
AMERICAN STATISTICIAN, 1989, 43 (01) :50-54