Towards improving the convolutional neural networks for deep learning using the distributed artificial bee colony method

被引:32
作者
Banharnsakun, Anan [1 ]
机构
[1] Kasetsart Univ, Fac Engn Sriracha, Comp Engn Dept, Computat Intelligence Res Lab CIRLab, Sriracha Campus, Chon Buri 20230, Thailand
关键词
Deep learning; Convolution neural networks; Distributed artificial bee colony; Pattern recognition; Classification; ALGORITHM; CLASSIFIERS;
D O I
10.1007/s13042-018-0811-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
During the past decade, the dramatic increase in the computational capabilities of chip processing and the lower costs of computing hardware have led to the emergence of deep learning, which refers to a sub-field of machine learning that focuses on learning features extracted from data and classifying them through multiple layers in the hierarchical architectures of neural networks. Using convolution neural networks (CNN) is one of the most promising deep learning methods for dealing with several pattern recognition tasks. However, as with most artificial neural networks, CNNs are susceptible to multiple local optima. Hence, in order to avoid becoming trapped within the local optima, improvement of the CNNs is thus required. The optimization methods based on a metaheuristic are very powerful in solving optimization problems. However, research on the use of metaheuristics to optimize CNNs is rarely conducted. In this work, the artificial bee colony (ABC) method, one of the most popular metaheuristic methods, is proposed as an alternative approach to optimizing the performance of a CNN. In other words, we aim to minimize the classification errors by initializing the weights of the CNN classifier based on solutions generated by the ABC method. Moreover, the distributed ABC is also presented as a method to maintain the amount of time needed to execute the process when working with large training datasets. The results of the experiment demonstrate that the proposed method can improve the performance of the ordinary CNNs in both recognition accuracy and computing time.
引用
收藏
页码:1301 / 1311
页数:11
相关论文
共 39 条
[21]   Human action recognition using genetic algorithms and convolutional neural networks [J].
Ijjina, Earnest Paul ;
Chalavadi, Krishna Mohan .
PATTERN RECOGNITION, 2016, 59 :199-212
[22]   A comprehensive survey: artificial bee colony (ABC) algorithm and applications [J].
Karaboga, Dervis ;
Gorkemli, Beyza ;
Ozturk, Celal ;
Karaboga, Nurhan .
ARTIFICIAL INTELLIGENCE REVIEW, 2014, 42 (01) :21-57
[23]  
Kennedy J, 1995, 1995 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS PROCEEDINGS, VOLS 1-6, P1942, DOI 10.1109/icnn.1995.488968
[24]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[25]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[26]  
LeCun Y, 2010, IEEE INT SYMP CIRC S, P253, DOI 10.1109/ISCAS.2010.5537907
[27]   ON THE LIMITED MEMORY BFGS METHOD FOR LARGE-SCALE OPTIMIZATION [J].
LIU, DC ;
NOCEDAL, J .
MATHEMATICAL PROGRAMMING, 1989, 45 (03) :503-528
[28]  
Ng AY, 2002, ADV NEUR IN, V14, P841
[29]  
Radzi SA, 2011, LECT NOTES COMPUT SC, V7066, P45, DOI 10.1007/978-3-642-25191-7_6
[30]  
Ramadhan I, 2016, 2016 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT)