Table Detection from Slide Images

被引:0
作者
Che, Xiaoyin [1 ]
Yang, Haojin [1 ]
Meinel, Christoph [1 ]
机构
[1] Univ Potsdam, Hasso Plattner Inst, Prof Dr Helmert Str 2-3, D-14482 Potsdam, Germany
来源
IMAGE AND VIDEO TECHNOLOGY, PSIVT 2015 | 2016年 / 9431卷
关键词
Table detection; Slide image; Table structure; DOCUMENTS;
D O I
10.1007/978-3-319-29451-3_60
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a solution to detect tables from slide images. Presentation slides are one type of document with growing importance. But the layout difference between slides and traditional documents makes many existing table detection methods less effective on slides. The proposed solution works with both high-resolution slide images from digital files and low-resolution slide screenshots from videos. By taking OCR (Optical Character Recognition) as initial step, a heuristic analysis on page layout focuses not only on the table structure but also the textual content. The evaluation result shows that the proposed solution achieves an approximate accuracy of 80 %. It is way better than the open-source academic solution Tesseract and also outperforms the commercial software ABBYY FineReader, which is supposed to be one of the best table detection tools.
引用
收藏
页码:762 / 774
页数:13
相关论文
共 21 条
  • [1] Ocropodium: open source OCR for small-scale historical archives
    Blanke, Tobias
    Bryant, Michael
    Hedges, Mark
    [J]. JOURNAL OF INFORMATION SCIENCE, 2012, 38 (01) : 76 - 86
  • [2] Canós JH, 2010, LECT NOTES COMPUT SC, V6273, P453, DOI 10.1007/978-3-642-15464-5_55
  • [3] Chattopadhyay T., 2011, 2011 Proceedings of International Conference on Computational Intelligence and Communication Networks (CICN 2011), P606, DOI 10.1109/CICN.2011.131
  • [4] A Table Detection Method for Multipage PDF Documents via Visual Seperators and Tabular Structures
    Fang, Jing
    Gao, Liangcai
    Bai, Kun
    Qiu, Ruiheng
    Tao, Xin
    Tang, Zhi
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 779 - 783
  • [5] Gatos B, 2005, LECT NOTES COMPUT SC, V3686, P609
  • [6] Table detection in handwritten chemistry documents using conditional random fields
    Ghanmi, Nabil
    Belaid, Abdel
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 146 - 151
  • [7] ICDAR 2013 Table Competition
    Goebel, Max
    Hassan, Tamir
    Oro, Ermelinda
    Orsi, Giorgio
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1449 - 1453
  • [8] "I'm Ambivalent about It": The Dilemmas of PowerPoint
    Hill, Andrea
    Arford, Tammi
    Lubitow, Amy
    Smollin, Leandra M.
    [J]. TEACHING SOCIOLOGY, 2012, 40 (03) : 242 - 256
  • [9] Learning to Detect Tables in Scanned Document Images using Line Information
    Kasar, T.
    Barlas, P.
    Adam, S.
    Chatelain, C.
    Paquet, T.
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1185 - 1189
  • [10] Table structure recognition based on robust block segmentation
    Kieninger, TG
    [J]. DOCUMENT RECOGNITION V, 1998, 3305 : 22 - 32