Handling dropout probability estimation in convolution neural networks using meta-heuristics

被引:33
作者
de Rosa, Gustavo H. [1 ]
Papa, Joao P. [1 ]
Yang, Xin-S [2 ]
机构
[1] Sao Paulo State Univ, Dept Comp, BR-17033360 Bauru, SP, Brazil
[2] Middlesex Univ, Sch Sci & Technol, London NW4 4BT, England
基金
巴西圣保罗研究基金会;
关键词
Convolutional neural networks; Dropout; Meta-heuristic optimization; RECOGNITION;
D O I
10.1007/s00500-017-2678-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based approaches have been paramount in recent years, mainly due to their outstanding results in several application domains, ranging from face and object recognition to handwritten digit identification. Convolutional neural networks (CNNs) have attracted a considerable attention since they model the intrinsic and complex brain working mechanisms. However, one main shortcoming of such models concerns their overfitting problem, which prevents the network from predicting unseen data effectively. In this paper, we address this problem by means of properly selecting a regularization parameter known as dropout in the context of CNNs using meta-heuristic-driven techniques. As far as we know, this is the first attempt to tackle this issue using this methodology. Additionally, we also take into account a default dropout parameter and a dropout-less CNN for comparison purposes. The results revealed that optimizing dropout-based CNNs is worthwhile, mainly due to the easiness in finding suitable dropout probability values, without needing to set new parameters empirically.
引用
收藏
页码:6147 / 6156
页数:10
相关论文
共 32 条
[1]  
[Anonymous], 2010, P ADV NEUR INF PROC
[2]  
[Anonymous], 2015, P GENETIC EVOLUTIONA
[3]   Deep Machine Learning-A New Frontier in Artificial Intelligence Research [J].
Arel, Itamar ;
Rose, Derek C. ;
Karnowski, Thomas P. .
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2010, 5 (04) :13-18
[4]  
Bishop C.M., 1995, Neural networks for pattern recognition
[5]  
Collobert R., 2008, P 25 ICML, P160, DOI [DOI 10.1145/1390156.1390177, 10.1145/1390156.1390177]
[6]  
Cox D., 2011, Proceedings 2011 IEEE International Conference on Automatic Face & Gesture Recognition (FG 2011), P8, DOI 10.1109/FG.2011.5771385
[7]  
Dahl GE, 2013, INT CONF ACOUST SPEE, P8609, DOI 10.1109/ICASSP.2013.6639346
[8]  
Eberhart R.C., 2001, Swarm Intelligence
[9]   Learning Hierarchical Features for Scene Labeling [J].
Farabet, Clement ;
Couprie, Camille ;
Najman, Laurent ;
LeCun, Yann .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1915-1929
[10]   NEOCOGNITRON - A NEW ALGORITHM FOR PATTERN-RECOGNITION TOLERANT OF DEFORMATIONS AND SHIFTS IN POSITION [J].
FUKUSHIMA, K ;
MIYAKE, S .
PATTERN RECOGNITION, 1982, 15 (06) :455-469