LDRNet: Enabling Real-Time Document Localization on Mobile Devices

被引:0
|
作者
Wu, Han [1 ]
Qian, Holland [2 ]
Wu, Huaming [3 ]
van Moorsel, Aad [4 ]
机构
[1] Newcastle Univ, Newcastle Upon Tyne, Tyne & Wear, England
[2] Tencent, Shenzhen, Peoples R China
[3] Tianjin Univ, Tianjin, Peoples R China
[4] Univ Birmingham, Birmingham, W Midlands, England
来源
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I | 2023年 / 1752卷
关键词
Document localization; Real time; Mobile devices;
D O I
10.1007/978-3-031-23618-1_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modern online services often require mobile devices to convert paper-based information into its digital counterpart, e.g., passport, ownership documents, etc. This process relies on Document Localization (DL) technology to detect the outline of a document within a photograph. In recent years, increased demand for real-time DL in live video has emerged, especially in financial services. However, existing machinelearning approaches to DL cannot be easily applied due to the large size of the underlying models and the associated long inference time. In this paper, we propose a lightweight DL model, LDRNet, to localize documents in real-time video captured on mobile devices. On the basis of a lightweight backbone neural network, we design three prediction branches for LDRNet: (1) corner points prediction; (2) line borders prediction and (3) document classification. To improve the accuracy, we design novel supplementary targets, the equal-division points, and use a new loss function named Line Loss. We compare the performance of LDRNet with other popular approaches on localization for general documents in a number of datasets. The experimental results show that LDRNet takes significantly less inference time, while still achieving comparable accuracy.
引用
收藏
页码:618 / 629
页数:12
相关论文
共 50 条
  • [41] Real-time Face Recognition with SIFT-based Local Feature Points for Mobile Devices
    Park, Sohee
    Yoo, Jang-Hee
    2013 FIRST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, MODELLING AND SIMULATION (AIMS 2013), 2013, : 304 - 308
  • [42] A Study of Users' Intention to Voluntarily Contribute Real-Time Traffic Information through Mobile Devices
    Zhu, Chen
    Wat, Kai Kwong
    Ren, Chao
    Liao, Stephen Shaoyi
    E-LIFE: WEB-ENABLED CONVERGENCE OF COMMERCE, WORK, AND SOCIAL LIFE, 2012, 108 : 421 - 428
  • [43] Linux real-time framework for fusion devices
    Neto, Andre
    Sartori, Filippo
    Piccolo, Fabio
    Barbalace, Antonio
    Vitelli, Riccardo
    Fernandes, Horacio
    FUSION ENGINEERING AND DESIGN, 2009, 84 (7-11) : 1408 - 1411
  • [44] Mobile Modeling with Real-Time Collaboration Support
    Haertwig, Max
    Goetz, Sebastian
    JOURNAL OF OBJECT TECHNOLOGY, 2022, 21 (03):
  • [45] Real-Time Data Prefetching in Mobile Computing
    Issam, Khalloufi
    Omar, El Beqqali
    2015 IEEE/ACS 12TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2015,
  • [46] Enabling Real-time Intelligent Decision Support in Intensive Care
    Portela, Filipe
    Santos, Manuel Filipe
    Gago, Pedro
    Silva, Alvaro
    Rua, Fernando
    Abelha, Antonio
    Machado, Jose
    Neves, Jose
    EUROPEAN SIMULATION AND MODELLING CONFERENCE 2011, 2011, : 419 - +
  • [47] Real-Time Virtual Shared Disk: Enabling multimedia on clusters
    Mukherjee, R
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 371 - 378
  • [48] eBiometrics: an enhanced multi-biometrics authentication technique for real-time remote applications on mobile devices
    Kuseler, Torben
    Lami, Ihsan
    Jassim, Sabah
    Sellahewa, Harin
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
  • [49] Energy-Efficient Interactive 360° Video Streaming with Real-Time Gaze Tracking on Mobile Devices
    Shen, Linfeng
    Chen, Yuchi
    Liu, Jiangchuan
    2021 IEEE 18TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2021), 2021, : 243 - 251
  • [50] PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
    Niu, Wei
    Ma, Xiaolong
    Lin, Sheng
    Wang, Shihao
    Qian, Xuehai
    Lin, Xue
    Wang, Yanzhi
    Ren, Bin
    TWENTY-FIFTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXV), 2020, : 907 - 922