A Pre-Training Strategy for Convolutional Neural Network Applied to Chinese Digital Gesture Recognition

被引:0
作者
Li, Yawei [1 ]
Yang, Yuliang [1 ]
Chen, Yueyun [1 ]
Zhu, Mengyu [2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] Beijing Inst Technol, Sch Life Sci & Technol, Beijing, Peoples R China
来源
PROCEEDINGS OF 2016 8TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2016) | 2016年
关键词
convolutional neural network; Chinese digital gesture recognition; principal component analysis; convolution kernels;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present an approach to classify Chinese digital gesture based on convolutional neural network (CNN). Principal Component Analysis (PCA) is employed to learn convolution kernels as the pre-training strategy. The learned convolution kernels are used for extracting features instead of the random convolution kernels. The convolutional layers can be directly implemented without any further training, such as Back Propagation (BP). For better understanding, we name the proposed architecture for PCA-based Convolutional Neural Network (PCNN). The dataset is divided into six gesture classes including 14500 gesture images, with 12000 images for training and 2500 images for testing. We examine the robustness of the PCNN against noises and distortions. In addition, the MNIST database of handwritten digits is employed to assess the suitability of the PCNN. Different from the CNN, the PCNN reduces the high computational cost of convolution kernels training. About one-fifth of the training time is shortened. The result shows that our approach classifies six gesture classes with 99.92% accuracy. Multiple experiments manifest the PCNN serving as an efficient approach for image processing and object recognition.
引用
收藏
页码:620 / 624
页数:5
相关论文
共 12 条
  • [1] [Anonymous], 2014, PROC INT C LEARN REP
  • [2] Bluche T, 2013, INT CONF ACOUST SPEE, P2390, DOI 10.1109/ICASSP.2013.6638083
  • [3] PCANet: A Simple Deep Learning Baseline for Image Classification?
    Chan, Tsung-Han
    Jia, Kui
    Gao, Shenghua
    Lu, Jiwen
    Zeng, Zinan
    Ma, Yi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 5017 - 5032
  • [4] Deng-Yuan Huang, 2009, Proceedings of the 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing. IIH-MSP 2009, P1, DOI 10.1109/IIH-MSP.2009.96
  • [5] Duffner S, 2014, INT CONF ACOUST SPEE, DOI 10.1109/ICASSP.2014.6854641
  • [6] Francke H, 2007, LECT NOTES COMPUT SC, V4872, P533
  • [7] A trainable feature extractor for handwritten digit recognition
    Lauer, Fabien
    Suen, Ching Y.
    Bloch, Gerard
    [J]. PATTERN RECOGNITION, 2007, 40 (06) : 1816 - 1824
  • [8] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [9] Liu WH, 2014, 2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), P480, DOI 10.1109/ChinaSIP.2014.6889289
  • [10] A multi-scale approach to gesture detection and recognition
    Neverova, Natalia
    Wolf, Christian
    Paci, Giulio
    Sommavilla, Giacomo
    Taylor, Graham W.
    Nebout, Florian
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 484 - 491