Footnote-based Document Image Classification using 1D Convolutional Neural Networks and Histograms

被引:0
作者
Mhiri, Mohamed [1 ]
Abuelwafa, Sherif [1 ]
Desrosiers, Christian [1 ]
Cheriet, Mohamed [1 ]
机构
[1] ETS, Montreal, PQ, Canada
来源
PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017) | 2017年
关键词
Footnote detection; Histograms classification; 1D Convolutional neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classifying historical document images is a challenging task due to the high variability of their content and the common presence of degradation in these documents. For scholars, footnotes are essential to analyze and investigate historical documents. In this work, a novel classification method is proposed for detecting and segmenting footnotes from document images. Our proposed method utilizes horizontal histograms of text lines as inputs to a 1D Convolutional Neural Network (CNN). Experiments on a dataset of historical documents show the proposed method to be effective in dealing with the high variability of footnotes, even while using a small training set. Our method yielded an overall F-measure of 5636% and a precision of 89.76 %, outperforming significantly existing approaches for this task.
引用
收藏
页数:5
相关论文
共 14 条
[1]  
Abuelwafa Sherif, 2017, INT C IM AN REC ICIA
[2]  
[Anonymous], 2006, P 9 EUR C COMP VIS 1
[3]  
[Anonymous], 2013, IEEE T PATTERN ANAL
[4]  
[Anonymous], 2005, PUTER VISION IMAGE U, DOI DOI 10.1016/J.CVIU.2007.09.014
[5]  
[Anonymous], 1999, P 7 IEEE INT C COMPU
[6]  
[Anonymous], 2009, FDN TRENDS MACHINE L
[7]  
Bordes Antoine, 2011, INT C ART INT STAT
[8]  
Cheriet M., 2013, IEEE GCC C EXH GCC
[9]  
dos Santos Rodolfo P., 2009, INT C DOC AN REC ICD
[10]  
Grafton Anthony., 1999, The Footnote: A Curious History