Robust Document Image Dewarping Method using Text-lines and Line Segments

被引:25
作者
Kil, Taeho [1 ,2 ]
Seo, Wonkyo [1 ,2 ]
Koo, Hyung Il [3 ]
Cho, Nam Ik [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Seoul Natl Univ, INMC, Seoul, South Korea
[3] Ajou Univ, Dept Elect & Comp Engn, Suwon, South Korea
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
关键词
ALGORITHM;
D O I
10.1109/ICDAR.2017.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional text-line based document dewarping methods have problems when handling complex layout and/or very few text-lines. When there are few aligned text-lines in the image, this usually means that photos, graphics and/or tables take large portion of the input instead. Hence, for the robust document dewarping, we propose to use line segments in the image in addition to the aligned text-lines. Based on the assumption and observation that many of the line segments in the image are horizontally or vertically aligned in the well-rectified images, we encode this property into the cost function in addition to the text-line alignment cost. By minimizing the function, we can obtain transformation parameters for camera pose, page curve, etc., which are used for document rectification. Considering that there are many outliers in line segment directions and missed text-lines in some cases, the overall algorithm is designed in an iterative manner. At each step, we remove text components and line segments that are not well aligned, and then minimize the cost function with the updated information. Experimental results show that the proposed method is robust to the variety of page layouts.
引用
收藏
页码:865 / 870
页数:6
相关论文
共 28 条
[1]   Rectification of planar targets using line segments [J].
An, Jaehyun ;
Koo, Hyung Il ;
Cho, Nam Ik .
MACHINE VISION AND APPLICATIONS, 2017, 28 (1-2) :91-100
[2]  
[Anonymous], 2017, IEEE T PATTERN ANAL
[3]  
[Anonymous], 1965, Problems Inf. Transm
[4]   Restoring 2D content from distorted documents [J].
Brown, Michael S. ;
Sun, Mingxuan ;
Yang, Ruigang ;
Yun, Lin ;
Seales, W. Brent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (11) :1904-1916
[5]  
BUKHARI S.S., 2009, P 3 INT WORKSHOP CAM, P34
[6]   Shape from shading for the digitization of curved documents [J].
Courteille, Frederic ;
Crouzil, Alain ;
Durou, Jean-Denis ;
Gurdjos, Pierre .
MACHINE VISION AND APPLICATIONS, 2007, 18 (05) :301-316
[7]   Towards Mobile Document Image Retrieval for Digital Library [J].
Duan, Ling-Yu ;
Ji, Rongrong ;
Chen, Zhang ;
Huang, Tiejun ;
Gao, Wen .
IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (02) :346-359
[8]  
Fu B, 2007, PROC 2 INT WORKSHOP, P63
[9]  
Gatos B, 2007, PROC INT CONF DOC, P989
[10]   Document dewarping via text-line based optimization [J].
Kim, Beom Su ;
Koo, Hyung Il ;
Cho, Nam Ik .
PATTERN RECOGNITION, 2015, 48 (11) :3600-3614