A comprehensive survey of mostly textual document segmentation algorithms since 2008

被引:68
作者
Eskenazi, Sebastien [1 ]
Gomez-Kramer, Petra [1 ]
Ogier, Jean-Marc [1 ]
机构
[1] La Rochelle Univ, Lab L3i, Ave Michel Crepeau, F-17042 La Rochelle, France
关键词
Document; Segmentation; Survey; Evaluation; Trends; Typology; OF-THE-ART; LAYOUT ANALYSIS; LINE SEGMENTATION; PAGE SEGMENTATION; IMAGE-ANALYSIS; HANDWRITTEN; EXTRACTION; MODEL; CRF;
D O I
10.1016/j.patcog.2016.10.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In document image analysis, segmentation is the task that identifies the regions of a document. The increasing number of applications of document analysis requires a good knowledge of the available technologies. This survey highlights the variety of the approaches that have been proposed for document image segmentation since 2008. It provides a clear typology of documents and of document image segmentation algorithms. We also discuss the technical limitations of these algorithms, the way they are evaluated and the general trends of the community.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 124 条
[1]   SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].
Achanta, Radhakrishna ;
Shaji, Appu ;
Smith, Kevin ;
Lucchi, Aurelien ;
Fua, Pascal ;
Suesstrunk, Sabine .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281
[2]  
Agrawal Mudit, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1011, DOI 10.1109/ICDAR.2009.270
[3]  
Agrawal M., 2010, Proceedings of the International Workshop on Document Analysis Systems, P73, DOI DOI 10.1145/1815330.1815340
[4]  
[Anonymous], 2013, Proceedings of the 2nd InternationalWorkshop on Historical Document Imaging and Processing
[5]  
[Anonymous], 1973, Cartographica: the international journal for geographic information and geovisualization, DOI [DOI 10.3138/FM57-6770-U75U-7727, 10.3138/FM57-6770-U75U-7727]
[6]  
Antonacopoulos A, 2015, PROC INT CONF DOC, P1151, DOI 10.1109/ICDAR.2015.7333941
[7]   ICDAR2013 Competition on Historical Newspaper Layout Analysis-HNLA2013 [J].
Antonacopoulos, A. ;
Clausner, C. ;
Papadopoulos, C. ;
Pletschacher, S. .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1454-1458
[8]   Historical Document Layout Analysis Competition [J].
Antonacopoulos, A. ;
Clausner, C. ;
Papadopoulos, C. ;
Pletschacher, S. .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :1516-1520
[9]  
Antonacopoulos A., 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P1370, DOI 10.1109/ICDAR.2009.275
[10]  
Asi A, 2015, PROC INT CONF DOC, P826, DOI 10.1109/ICDAR.2015.7333877