Neural Network Structure Optimization by Simulated Annealing

被引:16
作者
Kuo, Chun Lin [1 ]
Kuruoglu, Ercan Engin [1 ]
Chan, Wai Kin Victor [1 ]
机构
[1] Tsinghua Berkeley Shenzhen Inst, Shenzhen 518071, Peoples R China
基金
中国国家自然科学基金;
关键词
neural network; pruning; structure optimization; heuristics; simulated annealing; ALGORITHM;
D O I
10.3390/e24030348
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
A critical problem in large neural networks is over parameterization with a large number of weight parameters, which limits their use on edge devices due to prohibitive computational power and memory/storage requirements. To make neural networks more practical on edge devices and real-time industrial applications, they need to be compressed in advance. Since edge devices cannot train or access trained networks when internet resources are scarce, the preloading of smaller networks is essential. Various works in the literature have shown that the redundant branches can be pruned strategically in a fully connected network without sacrificing the performance significantly. However, majority of these methodologies need high computational resources to integrate weight training via the back-propagation algorithm during the process of network compression. In this work, we draw attention to the optimization of the network structure for preserving performance despite compression by pruning aggressively. The structure optimization is performed using the simulated annealing algorithm only, without utilizing back-propagation for branch weight training. Being a heuristic-based, non-convex optimization method, simulated annealing provides a globally near-optimal solution to this NP-hard problem for a given percentage of branch pruning. Our simulation results have shown that simulated annealing can significantly reduce the complexity of a fully connected network while maintaining the performance without the help of back-propagation.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Bayesian network structure learning combining K2 with simulated annealing
    Hu, Y. (hya507@sina.com), 1600, Southeast University, 2 Sipailou, Nanjing, 210096, China (42): : 82 - 86
  • [22] Optimization of neural networks: A comparative analysis of the genetic algorithm and simulated annealing
    Sexton, RS
    Dorsey, RE
    Johnson, JD
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1999, 114 (03) : 589 - 601
  • [23] SIDE-CHAIN PREDICTION BY NEURAL NETWORKS AND SIMULATED ANNEALING OPTIMIZATION
    HWANG, JK
    LIAO, WF
    PROTEIN ENGINEERING, 1995, 8 (04): : 363 - 370
  • [24] Shape Optimization of Soil-Steel Structure by Simulated Annealing
    Sobotka, Maciej
    Lyszba, Dariusz
    XXIII R-S-P SEMINAR, THEORETICAL FOUNDATION OF CIVIL ENGINEERING (23RSP) (TFOCE 2014), 2014, 91 : 304 - 309
  • [25] Simulated annealing and artificial neural network as optimization tools to enhance yields of bioemulsifier and exopolysaccharides by thermophilic Brevibacillus borstelensis
    Dhagat, Swasti
    Jujjavarapu, Satya Eswari
    JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2021, 9 (04):
  • [26] Sample design optimization for soil mapping using improved artificial neural networks and simulated annealing
    Shao, Shuangshuang
    Su, Baowei
    Zhang, Yalu
    Gao, Chao
    Zhang, Ming
    Zhang, Huan
    Yang, Lin
    GEODERMA, 2022, 413
  • [27] Optimal selection of neural network architecture for GAD using simulated annealing
    Gurcan, MN
    Sahiner, B
    Chan, HP
    Hadjiiski, L
    Petrick, N
    PROCEEDINGS OF THE 22ND ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4, 2000, 22 : 3052 - 3055
  • [28] Chaotic Simulated Annealing by a Neural Network with a Variable Delay: Design and Application
    Chen, Shyan-Shiou
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (10): : 1557 - 1565
  • [29] A hybrid neural network and simulated annealing approach to the unit commitment problem
    Nayak, R
    Sharma, JD
    COMPUTERS & ELECTRICAL ENGINEERING, 2000, 26 (06) : 461 - 477
  • [30] Improved simulated annealing mechanics in transiently chaotic neural network.
    Kang, B
    Li, XY
    Lu, BC
    2004 INTERNATIONAL CONFERENCE ON COMMUNICATION, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS - VOL 2: SIGNAL PROCESSING, CIRCUITS AND SYSTEMS, 2004, : 1057 - 1060