Object Detection Based Handwriting Localization

被引:3
作者
Wu, Yuli [1 ]
Hu, Yucheng [2 ]
Miao, Suting [3 ]
机构
[1] Rheinisch Westfalische TH Aachen, Aachen, Germany
[2] Nanjing Normal Univ, Nanjing, Peoples R China
[3] SAP Innovat Ctr Network ICN Nanjing, Nanjing, Peoples R China
来源
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II | 2021年 / 12917卷
关键词
Handwriting localization; Object detection; Regional convolutional neural network; Anonymization enhancement;
D O I
10.1007/978-3-030-86159-9_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an object detection based approach to localize handwritten regions from documents, which initially aims to enhance the anonymization during the data transmission. The concatenated fusion of original and preprocessed images containing both printed texts and handwritten notes or signatures are fed into the convolutional neural network, where the bounding boxes are learned to detect the handwriting. Afterwards, the handwritten regions can be processed (e.g. replaced with redacted signatures) to conceal the personally identifiable information (PII). This processing pipeline based on the deep learning network Cascade R-CNN works at 10 fps on a GPU during the inference, which ensures the enhanced anonymization with minimal computational overheads. Furthermore, the impressive generalizability has been empirically showcased: the trained model based on the English-dominant dataset works well on the fictitious unseen invoices, even in Chinese. The proposed approach is also expected to facilitate other tasks such as handwriting recognition and signature verification.
引用
收藏
页码:225 / 239
页数:15
相关论文
共 25 条
  • [1] Cascade R-CNN: High Quality Object Detection and Instance Segmentation
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1483 - 1498
  • [3] Chen Kai, 2019, arXiv preprint arXiv:1906.07155
  • [4] USE OF HOUGH TRANSFORMATION TO DETECT LINES AND CURVES IN PICTURES
    DUDA, RO
    HART, PE
    [J]. COMMUNICATIONS OF THE ACM, 1972, 15 (01) : 11 - &
  • [5] The PASCAL Visual Object Classes Challenge: A Retrospective
    Everingham, Mark
    Eslami, S. M. Ali
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136
  • [6] LocNet: Improving Localization Accuracy for Object Detection
    Gidaris, Spyros
    Komodakis, Nikos
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 789 - 798
  • [7] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [8] Harley AW, 2015, PROC INT CONF DOC, P991, DOI 10.1109/ICDAR.2015.7333910
  • [9] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [10] Joseph R., 2018, YOLOV3 INCREMENTAL I, DOI DOI 10.48550/ARXIV.1804.02767