A snapshot of image pre-processing for convolutional neural networks: case study of MNIST

被引:51
作者
Tabik, Siham [1 ]
Peralta, Daniel [2 ,3 ]
Herrera-Poyatos, Andres [1 ]
Herrera, Francisco [1 ,4 ]
机构
[1] Univ Granada, Res Grp Soft Comp & Intelligent Informat Syst, E-18071 Granada, Spain
[2] VIB, Data Min & Modelling Biomed Grp, Inflammat Res Ctr, Ghent, Belgium
[3] Univ Ghent, Dept Internal Med, Ghent, Belgium
[4] Univ Granada, Dept Comp Sci & Artificial Intelligence, E-18071 Granada, Spain
关键词
Classification; Deep learning; Convolutional Neural Networks (CNNs); preprocessing; handwritten digits; data augmentation;
D O I
10.2991/ijcis.2017.10.1.38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last five years, deep learning methods and particularly Convolutional Neural Networks (CNNs) have exhibited excellent accuracies in many pattern classification problems. Most of the state-of-the-art models apply data-augmentation techniques at the training stage. This paper provides a brief tutorial on data preprocessing and shows its benefits by using the competitive MNIST handwritten digits classification problem. We show and analyze the impact of different preprocessing techniques on the performance of three CNNs, LeNet, Network3 and DropConnect, together with their ensembles. The analyzed transformations are, centering, elastic deformation, translation, rotation and different combinations of them. Our analysis demonstrates that data-preprocessing techniques, such as the combination of elastic deformation and rotation, together with ensembles have a high potential to further improve the state-of-the-art accuracy in MNIST classification.
引用
收藏
页码:555 / 568
页数:14
相关论文
共 30 条
[1]  
[Anonymous], 2015, Batch-normalized maxout network in network
[2]  
[Anonymous], MNIST DATABASE HANDW
[3]  
[Anonymous], 2015, ARXIV150503229
[4]  
[Anonymous], ABS160502688 ARXIV T
[5]  
Benenson R., Classification datasets results
[6]  
Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110
[7]   Multi-column deep neural network for traffic sign classification [J].
Ciresan, Dan ;
Meier, Ueli ;
Masci, Jonathan ;
Schmidhuber, Juergen .
NEURAL NETWORKS, 2012, 32 :333-338
[8]  
Dahl GE, 2013, INT CONF ACOUST SPEE, P8609, DOI 10.1109/ICASSP.2013.6639346
[9]  
Duda R O., 2012, Pattern Classification
[10]  
Garcia S, 2015, INTEL SYST REF LIBR, V72, P1, DOI 10.1007/978-3-319-10247-4