Learning Document Image Features With SqueezeNet Convolutional Neural Network

被引:16
作者
Hassanpour, M. [1 ]
Malek, H. [1 ]
机构
[1] Shahid Beheshti Univ, Dept Comp Sci Engn, Tehran, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2020年 / 33卷 / 07期
关键词
Squeezenet; Convolutional Neural; Network; Document Image; Classification;
D O I
10.5829/ije.2020.33.07a.05
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The classification of various document image classes is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for training, and their very large number of weights. Previous successful attempts at learning document image features have been based on training very large CNNs. SqueezeNet is a CNN architecture that achieves accuracies comparable to other state of the art CNNs while containing up to 50 times less weights, but never before experimented on document image classification tasks. In this research we have taken a novel approach towards learning these document image features by training on a very small CNN network such as SqueezeNet. We show that an ImageNet pretrained SqueezeNet achieves an accuracy of approximately 75 percent over 10 classes on the Tobacco-3482 dataset, which is comparable to other state of the art CNN. We then visualize saliency maps of the gradient of our trained SqueezeNet's output to input, which shows that the network is able to learn meaningful features that are useful for document classification. Previous works in this field have made no emphasis on visualizing the learned document features. The importance of features such as the existence of handwritten text, document titles, text alignment and tabular structures in the extracted saliency maps, proves that the network does not overfit to redundant representations of the rather small Tobacco-3482 dataset, which contains only 3482 document images over 10 classes.
引用
收藏
页码:1201 / 1207
页数:7
相关论文
共 50 条
  • [41] Dynamic Learning Convolutional Network with Skip Layers for Image Segmentation
    Lyu, Chengzhi
    Hu, Guoqing
    Wang, Dan
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 2466 - 2470
  • [42] IoT individual privacy features analysis based on convolutional neural network
    Meng Xi
    Nie Lingyu
    Song Jiapeng
    COGNITIVE SYSTEMS RESEARCH, 2019, 57 : 126 - 130
  • [43] Facial Expression Recognition Using Salient Features and Convolutional Neural Network
    Uddin, Md. Zia
    Khaksar, Weria
    Torresen, Jim
    IEEE ACCESS, 2017, 5 : 26146 - 26161
  • [44] Adaptive learning cost-sensitive convolutional neural network
    Hou, Yun
    Fan, Hong
    Li, Li
    Li, Bailin
    IET COMPUTER VISION, 2021, 15 (05) : 346 - 355
  • [45] POLSAR IMAGE CLASSIFICATION VIA COMPLEX-VALUED CONVOLUTIONAL NEURAL NETWORK COMBINING MEASURED DATA AND ARTIFICIAL FEATURES
    Qin, Xianxiang
    Hu, Tao
    Zou, Huanxin
    Yu, Wangsheng
    Wang, Peng
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3209 - 3212
  • [46] A robust document image watermarking scheme using deep neural network
    Ge, Sulong
    Xia, Zhihua
    Fei, Jianwei
    Tong, Yao
    Weng, Jian
    Li, Ming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 38589 - 38612
  • [47] A generalized framework of feature learning enhanced convolutional neural network for pathology-image-oriented cancer diagnosis
    Li, Han
    Wu, Peishu
    Wang, Zidong
    Mao, Jingfeng
    Alsaadi, Fuad E.
    Zeng, Nianyin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [48] Analysis and Recognition of Clinical Features of Diabetes Based on Convolutional Neural Network
    Wang, Rui
    Li, Ping
    Yang, Zhengfei
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [49] Image Based ECG Signal Classification Using Convolutional Neural Network
    Hadiyoso, Sugondo
    Fahrozi, Farrel
    Hariyani, Yuli Sun
    Sulistyo, Mahmud Dwi
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2022, 18 (04) : 64 - 78
  • [50] Sequence to Image Transform Based Convolutional Neural Network for Load Forecasting
    Imani, Maryam
    Ghassemian, Hassan
    2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 1362 - 1366