Particle swarm optimization-based automatic parameter selection for deep neural networks and its applications in large-scale and high-dimensional data

被引:54
作者
Ye, Fei [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Sichuan, Peoples R China
关键词
LEFT-VENTRICLE; SEGMENTATION; MODEL; ALGORITHM;
D O I
10.1371/journal.pone.0188746
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we propose a new automatic hyperparameter selection approach for determining the optimal network configuration (network structure and hyperparameters) for deep neural networks using particle swarm optimization (PSO) in combination with a steepest gradient descent algorithm. In the proposed approach, network configurations were coded as a set of real-number m-dimensional vectors as the individuals of the PSO algorithm in the search procedure. During the search procedure, the PSO algorithm is employed to search for optimal network configurations via the particles moving in a finite search space, and the steepest gradient descent algorithm is used to train the DNN classifier with a few training epochs (to find a local optimal solution) during the population evaluation of PSO. After the optimization scheme, the steepest gradient descent algorithm is performed with more epochs and the final solutions (pbest and gbest) of the PSO algorithm to train a final ensemble model and individual DNN classifiers, respectively. The local search ability of the steepest gradient descent algorithm and the global search capabilities of the PSO algorithm are exploited to determine an optimal solution that is close to the global optimum. We constructed several experiments on hand-written characters and biological activity prediction datasets to show that the DNN classifiers trained by the network configurations expressed by the final solutions of the PSO algorithm, employed to construct an ensemble model and individual classifier, outperform the random approach in terms of the generalization performance. Therefore, the proposed approach can be regarded an alternative tool for automatic network structure and parameter selection for deep neural networks.
引用
收藏
页数:36
相关论文
共 62 条
[1]   A genetic algorithm for shortest path routing problem and the sizing of populations [J].
Ahn, CW ;
Ramakrishna, RS .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (06) :566-579
[2]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[3]   Deep learning for computational biology [J].
Angermueller, Christof ;
Parnamaa, Tanel ;
Parts, Leopold ;
Stegle, Oliver .
MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (07)
[4]  
[Anonymous], HUMAN ACTION RECOGNI
[5]  
[Anonymous], 2013, arXiv
[6]   Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network [J].
Anthimopoulos, Marios ;
Christodoulidis, Stergios ;
Ebner, Lukas ;
Christe, Andreas ;
Mougiakakou, Stavroula .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) :1207-1216
[7]   A combined deep-learning and deformable-model approach to fully automatic segmentation of the left ventricle in cardiac MRI [J].
Avendi, M. R. ;
Kheradvar, Arash ;
Jafarkhani, Hamid .
MEDICAL IMAGE ANALYSIS, 2016, 30 :108-119
[8]   A Meta-Framework for Modeling the Human Reading Process in Sentiment Analysis [J].
Baly, Ramy ;
Hobeica, Roula ;
Hajj, Hazem ;
El-Hajj, Wassim ;
Shaban, Khaled Bashir ;
Al-Sallab, Ahmad .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2016, 35 (01)
[9]  
Carneiro G., 2012, SEGMENTATION LEFT VE
[10]   Urinary bladder segmentation in CT urography using deep-learning convolutional neural network and level sets [J].
Cha, Kenny H. ;
Hadjiiski, Lubomir ;
Samala, Ravi K. ;
Chan, Heang-Ping ;
Caoili, Elaine M. ;
Cohan, Richard H. .
MEDICAL PHYSICS, 2016, 43 (04) :1882-1896