Handwritten/Printed Receipt Classification using Attention-Based Convolutional Neural Network

被引:0
作者
Yang, Fan [1 ]
Jin, Lianwen [1 ]
Yang, Weixin [1 ]
Feng, Ziyong [1 ]
Zhang, Shuye [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
来源
PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR) | 2016年
关键词
Handwritten/printed receipt classification; convolutional neural network; attention-based approach; region of interest; PRINTED TEXT;
D O I
10.1109/ICFHR.2016.73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an approach for the classification of handwritten and printed receipts based on a convolutional neural network (CNN). One of the main challenges related to such classification is the diversity of the background interference in the receipt images. To overcome this problem, we propose a new technique named "attention-based CNN" (ABCNN), inspired by the concept of "attention" in visual neuroscience. This approach helps us to focus on the receipt in an image without bounding box annotation. Our experimental results showed that the proposed ABCNN (i) significantly improves the classification accuracy compared to normal CNN (from 95% to 98.25%), and (ii) enables the network to process images directly without object detection, and (iii) it is faster to train and test the network.
引用
收藏
页码:384 / 389
页数:6
相关论文
共 22 条
[1]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[2]  
[Anonymous], P IEEE C COMP VIS PA
[3]  
[Anonymous], 2012, COMPUTER VISION PATT
[4]  
[Anonymous], 1989, NEURAL COMPUTATION
[5]   A backward progression of attentional effects in the ventral stream [J].
Buffalo, Elizabeth A. ;
Fries, Pascal ;
Landman, Rogier ;
Liang, Hualou ;
Desimone, Robert .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (01) :361-365
[6]  
Chatfield K., 2014, ARXIV14053531V2
[7]  
Ciresan D., INT JOINT C ART INT, P1237
[8]   Simultaneous Detection and Segmentation [J].
Hariharan, Bharath ;
Arbelaez, Pablo ;
Girshick, Ross ;
Malik, Jitendra .
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 :297-312
[9]  
Jia Y., 2014, P 22 ACM INT C MULT, P675, DOI DOI 10.1145/2647868.2654889
[10]  
Jindal A, 2014, IEEE INT ADV COMPUT, P1028, DOI 10.1109/IAdCC.2014.6779466