CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

被引：39

作者：

Agarwal, Madhav ^{[1
]}

Mondal, Ajoy ^{[1
]}

Jawahar, C., V ^{[1
]}

机构：

[1] IIIT, CVIT, Hyderabad, India

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

关键词：

Page object; table detection; Cascade Mask R-CNN; deformable convolution; single model;

D O I：

10.1109/ICPR48806.2021.9411922

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel endto-end trainable deep network, (epee-Net) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher iou threshold. We empirically evaluate epee-Net on the publicly available benchmark datasets with extensive experiments. Our solution has three important properties: (i) a single trained model CDeC-NeT double dagger that performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of iou; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models are publicly available at https://githuh.com/mdv3101/CDeCNet for enabling reproducibility of the results.

引用

页码：9491 / 9498

页数：8

共 50 条

[1]

Arif S., 2018, DICTA

[2]

Cai Z., 2019, IEEE T PAMI

[3]

Casado-Garc<prime>ia C., 2019, BENEFITS CLOSE DOMAI

[4]

Cesarini F., 2002, OBJECT RECOGNITION S

[5]

Chandran S., 1993, ICDAR

[6]

Chen K., 2019, arXiv:1906.07155

[7] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[8]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[9]

e Silva A. C., 2009, ICDAR

[10]

Fan M., 2015, CORR

← 1 2 3 4 5 →