ERCS: An Efficient and Robust Card Recognition System for Camera-Based Image

被引:0
作者
Ou, Zhonghong [1 ]
Xiong, Baiqiao [1 ]
Xiao, Fenrui [1 ]
Song, Meina [1 ]
机构
[1] Beijing Univ Posts & Telecommun, 10 Xitucheng Rd, Beijing 100876, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
card localization; card recognition; optical character recognition; CNN; SCENE TEXT DETECTION; BUSINESS; LOCALIZATION;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Cards Recognition Systems, (CRSs) are representative computer vision-based applications. They have a broad range of usage scenarios. For example, they can be used to recognize images containing business cards, personal identification cards, and bank cards etc. Even though CRSs have been studied for many years, it is still difficult to recognize cards in camera-based images taken by ordinary devices, e.g., mobile phones. Diversity of viewpoints and complex backgrounds in the images make the recognition task challenging. Existing systems employing traditional image processing schemes are not robust to varied environment, and are inefficient in dealing with natural images, e.g., taken by mobile phones. To tackle the problem, we propose a novel framework for card recognition by employing a Convolutional Neutral Network (CNN) based approach. The system localizes the foreground of the image by utilizing a Fully Convolutional Network (FCN). With the help of the foreground map, the system localizes the corners of the card region and employs perspective transformation to alleviate the effects from distortion. Text lines in the card region are detected and recognized by utilizing CNN and Long Short Term Memory, (LSTM).To evaluate the proposed scheme, we collect a large dataset which contains 4,065 images in a variety of shooting scenarios. Experimental results demonstrate the efficacy of the proposed scheme. Specifically, it is able to achieve an accuracy of 90.62% in the end-toend test ? outperforming the state-of-the-art.
引用
收藏
页码:247 / 264
页数:18
相关论文
共 50 条
  • [1] [Anonymous], 2005, P CAMERA BASED DOCUM
  • [2] [Anonymous], 2007, P 2 INT WORKSH CAM B
  • [3] Large-Scale Machine Learning with Stochastic Gradient Descent
    Bottou, Leon
    [J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
  • [4] Burie JC, 2015, PROC INT CONF DOC, P1161, DOI 10.1109/ICDAR.2015.7333943
  • [5] Chiou YH, 1997, PROC INT CONF DOC, P1028, DOI 10.1109/ICDAR.1997.620665
  • [6] Rectifying perspective views of text in 3D scenes using vanishing points
    Clark, P
    Mirmehdi, M
    [J]. PATTERN RECOGNITION, 2003, 36 (11) : 2673 - 2686
  • [7] Text Detection and Character Recognition in Scene Images with Unsupervised Feature Learning
    Coates, Adam
    Carpenter, Blake
    Case, Carl
    Satheesh, Sanjeev
    Suresh, Bipin
    Wang, Tao
    Wu, David J.
    Ng, Andrew Y.
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 440 - 445
  • [8] Dangiwa BA, 2018, 2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INFORMATION SECURITY (ICSPIS), P93
  • [9] Deng D, 2018, AAAI CONF ARTIF INTE, P6773
  • [10] Elagouni K., 2012, Proceedings of the 10th IAPR International Workshop on Document Analysis Systems (DAS 2012), P120, DOI 10.1109/DAS.2012.26