Neural Network Structure Optimization by Simulated Annealing

被引:19
作者
Kuo, Chun Lin [1 ]
Kuruoglu, Ercan Engin [1 ]
Chan, Wai Kin Victor [1 ]
机构
[1] Tsinghua Berkeley Shenzhen Inst, Shenzhen 518071, Peoples R China
基金
中国国家自然科学基金;
关键词
neural network; pruning; structure optimization; heuristics; simulated annealing; ALGORITHM;
D O I
10.3390/e24030348
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
A critical problem in large neural networks is over parameterization with a large number of weight parameters, which limits their use on edge devices due to prohibitive computational power and memory/storage requirements. To make neural networks more practical on edge devices and real-time industrial applications, they need to be compressed in advance. Since edge devices cannot train or access trained networks when internet resources are scarce, the preloading of smaller networks is essential. Various works in the literature have shown that the redundant branches can be pruned strategically in a fully connected network without sacrificing the performance significantly. However, majority of these methodologies need high computational resources to integrate weight training via the back-propagation algorithm during the process of network compression. In this work, we draw attention to the optimization of the network structure for preserving performance despite compression by pruning aggressively. The structure optimization is performed using the simulated annealing algorithm only, without utilizing back-propagation for branch weight training. Being a heuristic-based, non-convex optimization method, simulated annealing provides a globally near-optimal solution to this NP-hard problem for a given percentage of branch pruning. Our simulation results have shown that simulated annealing can significantly reduce the complexity of a fully connected network while maintaining the performance without the help of back-propagation.
引用
收藏
页数:18
相关论文
共 50 条
[41]   GLOBAL OPTIMIZATION AND SIMULATED ANNEALING [J].
DEKKERS, A ;
AARTS, E .
MATHEMATICAL PROGRAMMING, 1991, 50 (03) :367-393
[42]   Robust optimization with simulated annealing [J].
Bertsimas, Dimitris ;
Nohadani, Omid .
JOURNAL OF GLOBAL OPTIMIZATION, 2010, 48 (02) :323-334
[43]   A Simulated Annealing Accelerated Bayesian Optimization (SAABO) for Real-Time Deep Neural Network Model with Range Constraints [J].
Madasu, Srinath ;
Rangarajan, Keshava P. .
2018 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2018, :626-631
[44]   Hybrid Neural Network And Regression Tree Ensemble Pruned By Simulated Annealing For Virtual Flow Metering Application [J].
AL-Qutami, Tareq Aziz ;
Ibrahim, Rosdiazli ;
Ismail, Idris .
2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2017, :304-309
[45]   Evolving Neural Network Using Hybrid Genetic Algorithm and Simulated Annealing for Rainfall-Runoff Forecasting [J].
Ding, Hong ;
Wu, Jiansheng ;
Li, Xianghui .
ADVANCES IN SWARM INTELLIGENCE, ICSI 2012, PT I, 2012, 7331 :444-451
[46]   Application of neural network based on simulated annealing to classification of remote sensing image [J].
Pang, Xiaoqiong ;
Chen, Lichao ;
Chen, Wenjun .
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, :2874-2877
[47]   Building Logistics Cost Forecast Based on Improved Simulated Annealing Neural Network [J].
Tian, Jingwen ;
Gao, Meijuan .
ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL III, PROCEEDINGS, 2009, :914-917
[48]   Dynamic balance of a bipedal robot using neural network training with simulated annealing [J].
Angeles-Garcia, Yoqsan ;
Calvo, Hiram ;
Sossa, Humberto ;
Anzueto-Rios, Alvaro .
FRONTIERS IN NEUROROBOTICS, 2022, 16
[49]   Timing-Driven Simulated Annealing for FPGA Placement in Neural Network Realization [J].
Yu, Le ;
Guo, Baojin .
ELECTRONICS, 2023, 12 (17)
[50]   Application of neural network based on the genetic simulated annealing algorithm in failure diagnosis [J].
Hu Yu-lan ;
Fu Wen ;
Li Xue-mei .
Proceedings of 2006 Chinese Control and Decision Conference, 2006, :615-618