CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

被引:39
作者
Agarwal, Madhav [1 ]
Mondal, Ajoy [1 ]
Jawahar, C., V [1 ]
机构
[1] IIIT, CVIT, Hyderabad, India
来源
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年
关键词
Page object; table detection; Cascade Mask R-CNN; deformable convolution; single model;
D O I
10.1109/ICPR48806.2021.9411922
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Localizing page elements/objects such as tables, figures, equations, etc. is the primary step in extracting information from document images. We propose a novel endto-end trainable deep network, (epee-Net) for detecting tables present in the documents. The proposed network consists of a multistage extension of Mask R-CNN with a dual backbone having deformable convolution for detecting tables varying in scale with high detection accuracy at higher iou threshold. We empirically evaluate epee-Net on the publicly available benchmark datasets with extensive experiments. Our solution has three important properties: (i) a single trained model CDeC-NeT double dagger that performs well across all the popular benchmark datasets; (ii) we report excellent performances across multiple, including higher, thresholds of iou; (iii) by following the same protocol of the recent papers for each of the benchmarks, we consistently demonstrate the superior quantitative performance. Our code and models are publicly available at https://githuh.com/mdv3101/CDeCNet for enabling reproducibility of the results.
引用
收藏
页码:9491 / 9498
页数:8
相关论文
共 50 条
[1]  
Arif S., 2018, DICTA
[2]  
Cai Z., 2019, IEEE T PAMI
[3]  
Casado-Garc<prime>ia C., 2019, BENEFITS CLOSE DOMAI
[4]  
Cesarini F., 2002, OBJECT RECOGNITION S
[5]  
Chandran S., 1993, ICDAR
[6]  
Chen K., 2019, arXiv:1906.07155
[7]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]  
e Silva A. C., 2009, ICDAR
[10]  
Fan M., 2015, CORR