Image based fruit category classification by 13-layer deep convolutional neural network and data augmentation

被引:222
作者
Zhang, Yu-Dong [1 ,2 ]
Dong, Zhengchao [3 ,4 ,5 ]
Chen, Xianqing [6 ]
Jia, Wenjuan [7 ]
Du, Sidan [8 ]
Muhammad, Khan [9 ]
Wang, Shui-Hua [1 ]
机构
[1] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo 454000, Henan, Peoples R China
[2] Jiangsu Key Lab Adv Mfg Technol, Huaiyin 223003, Jiangsu, Peoples R China
[3] Columbia Univ, Translat Imaging Div, New York, NY 10032 USA
[4] Columbia Univ, MRI Unit, New York, NY 10032 USA
[5] New York State Psychiat Inst & Hosp, New York, NY 10032 USA
[6] Zhejiang Normal Univ, Dept Elect Engn, Coll Engn, Jinhua 321004, Zhejiang, Peoples R China
[7] Nanjing Normal Univ, Sch Comp Sci & Technol, Nanjing 210023, Jiangsu, Peoples R China
[8] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210046, Jiangsu, Peoples R China
[9] Sejong Univ, Coll Software Convergence, Seoul, South Korea
关键词
Convolutional neural network; Fully connected layer; Softmax; Fruit category identification; ENTROPY;
D O I
10.1007/s11042-017-5243-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fruit category identification is important in factories, supermarkets, and other fields. Current computer vision systems used handcrafted features, and did not get good results. In this study, our team designed a 13-layer convolutional neural network (CNN). Three types of data augmentation method was used: image rotation, Gamma correction, and noise injection. We also compared max pooling with average pooling. The stochastic gradient descent with momentum was used to train the CNN with minibatch size of 128. The overall accuracy of our method is 94.94%, at least 5 percentage points higher than state-of-the-art approaches. We validated this 13-layer is the optimal structure. The GPU can achieve a 177x acceleration on training data, and a 175x acceleration on test data. We observed using data augmentation can increase the overall accuracy. Our method is effective in image-based fruit classification.
引用
收藏
页码:3613 / 3632
页数:20
相关论文
共 43 条
[1]   Convolutional neural networks for vibrational spectroscopic data analysis [J].
Acquarelli, Jacopo ;
van Laarhoven, Twan ;
Gerretzen, Jan ;
Tran, Thanh N. ;
Buydens, Lutgarde M. C. ;
Marchiori, Elena .
ANALYTICA CHIMICA ACTA, 2017, 954 :22-31
[2]   Classification of E-Nose Aroma Data of Four Fruit Types by ABC-Based Neural Network [J].
Adak, M. Fatih ;
Yumusak, Nejat .
SENSORS, 2016, 16 (03)
[3]   Efficient object-based surveillance image search using spatial pooling of convolutional features [J].
Ahmad, Jamil ;
Mehmood, Irfan ;
Baik, Sung Wook .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 45 :62-76
[4]  
[Anonymous], 2014, J FOOD ENG, DOI DOI 10.1016/j.jfoodeng.2014.07.001
[5]  
[Anonymous], 2016, IEEE ACM T NETWORK, DOI DOI 10.1109/TNET.2015.2425146
[6]  
[Anonymous], EXP SYST, DOI DOI 10.1111/EXSY.12146
[7]  
[Anonymous], 2016, BMC PLANT BIOL
[8]  
[Anonymous], 2017, ADV SOC SCI EDUC HUM
[9]  
[Anonymous], 2017, 5 INT C IND APPL ENG
[10]   Text/non-text image classification in the wild with convolutional neural networks [J].
Bai, Xiang ;
Shi, Baoguang ;
Zhang, Chengquan ;
Cai, Xuan ;
Qi, Li .
PATTERN RECOGNITION, 2017, 66 :437-446