Optimized Parameter-Efficient Deep Learning Systems via Reversible Jump Simulated Annealing

被引:0
|
作者
Marsh, Peter [1 ]
Kuruoglu, Ercan Engin [1 ]
机构
[1] Univ Town Shenzhen, Tsinghua Berkeley Shenzhen Inst, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
关键词
Simulated annealing; Optimization; Neural networks; Long short term memory; Image recognition; Task analysis; Data models; Deep learning systems; model selection; reversible jump Markov chain Monte Carlo; simulated annealing; NEURAL-NETWORK; ALGORITHM;
D O I
10.1109/JSTSP.2024.3428355
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We utilize the non-convex optimization method simulated annealing enriched with reversible jumps to enable a model selection capacity for deep learning models in a model size aware context. By using simulated annealing enriched with reversible jumps, we can yield a robust stochastic learning of the hidden posterior distribution of the structure, simultaneously constructing a more focused and certain estimate of the structure, all while making use of all the data. Being based upon Markov-chain learning methods, we constructed our priors to favor smaller and simpler architectures, allowing us to converge on the set of globally optimal models that are additionally parameter-efficient, seeking low parameter count deep models that retain good predictive accuracy. We demonstrate the capability on standard image recognition with CIFAR-10, as well as performing model selection on time-series tasks, realizing networks with competitive performance as compared to competing non-convex optimization methods such as genetic algorithms, random search, and Gaussian process based Bayesian optimization, while being less than half the size.
引用
收藏
页码:1010 / 1023
页数:14
相关论文
共 50 条
  • [41] An Efficient Approach for Automatic Complex Fractured Networks Parameter Inversion Based on Surrogate Model and Deep Reinforcement Learning
    Chen, Zhiming
    Dong, Peng
    Li, Dexuan
    WATER RESOURCES RESEARCH, 2022, 58 (12)
  • [42] Prediction model for methanation reaction conditions based on a state transition simulated annealing algorithm optimized extreme learning machine
    Shen, Yadi
    Dong, Yingchao
    Han, Xiaoxia
    Wu, Jinde
    Xue, Kun
    Jin, Meizhu
    Xie, Gang
    Xu, Xinying
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2023, 48 (64) : 24560 - 24573
  • [43] ResumeGAN: An Optimized Deep Representation Learning Framework for Talent-Job Fit via Adversarial Learning
    Luo, Yong
    Zhang, Huaizheng
    Wen, Yonggang
    Zhang, Xinwen
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1101 - 1110
  • [44] Efficient Collision Risk Prediction Model for Autonomous Vehicle Using Novel Optimized LSTM Based Deep Learning Framework
    Hema, D. Deva
    Jaison, T. Rajeeth
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2024, 22 (02) : 352 - 362
  • [45] PSO-Pelican Arrhythmia Optimize: Revolutionizing Arrhythmia Detection via Automated Deep Learning Parameter Tuning
    Jayanthi
    Devi, Prasanna
    Zakariah, Mohammed
    Almazyad, Abdulaziz S.
    TRAITEMENT DU SIGNAL, 2024, 41 (06) : 3011 - 3026
  • [46] A Physics-Based Hyper Parameter Optimized Federated Multi-Layered Deep Learning Model for Intrusion Detection in IoT Networks
    Chandnani, Chirag Jitendra
    Agarwal, Vedik
    Kulkarni, Shlok Chetan
    Aren, Aditya
    Amali, D. Geraldine Bessie
    Srinivasan, Kathiravan
    IEEE ACCESS, 2025, 13 : 21992 - 22010
  • [47] Towards efficient and effective renewable energy prediction via deep learning
    Khan, Zulfiqar Ahmad
    Hussain, Tanveer
    Ul Haq, Ijaz
    Ullah, Fath U. Min
    Baik, Sung Wook
    ENERGY REPORTS, 2022, 8 : 10230 - 10243
  • [48] Deep Learning Based Fusion Model for Multivariate LTE Traffic Forecasting and Optimized Radio Parameter Estimation
    Nabi, Syed Tauhidun
    Islam, Md. Rashidul
    Alam, Md. Golam Rabiul
    Hassan, Mohammad Mehedi
    AlQahtani, Salman A.
    Aloi, Gianluca
    Fortino, Giancarlo
    IEEE ACCESS, 2023, 11 : 14533 - 14549
  • [49] Towards Efficient Mapless Navigation Using Deep Reinforcement Learning with Parameter Space Noise
    Liu, Xiaoyun
    Zhou, Qingrui
    Wang, Hui
    Yang, Ying
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8833 - 8837
  • [50] An Efficient Multimodal Emotion Identification Using FOX Optimized Double Deep Q-Learning
    Selvi, R.
    Vijayakumaran, C.
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 132 (04) : 2387 - 2406