Saving Memory Space in Deep Neural Networks by Recomputing: A Survey

被引:0
|
作者
Ulidowski, Irek [1 ,2 ]
机构
[1] Univ Leicester, Sch Comp & Math Sci, Leicester, Leics, England
[2] AGH Univ Sci & Technol, Dept Appl Informat, Krakow, Poland
来源
REVERSIBLE COMPUTATION, RC 2023 | 2023年 / 13960卷
关键词
Deep Neural Networks; recomputing activations;
D O I
10.1007/978-3-031-38100-3_7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Training a multilayered neural network involves execution of the network on the training data, followed by calculating the error between the predicted and actual output, and then performing backpropagation to update the network's weights in order to minimise the overall error. This process is repeated many times, with the network updating its weights until it produces the desired output with a satisfactory level of accuracy. It requires storage in memory of activation and gradient data for each layer during each training run of the network. This paper surveys the main approaches to recomputing the needed activation and gradient data instead of storing it in memory. We discuss how these approaches relate to reversible computation techniques.
引用
收藏
页码:89 / 105
页数:17
相关论文
共 50 条
  • [11] Facial Age Estimation using Deep neural networks: A Survey
    Badr, Marwa Mahmoud
    Sarhan, Amany Mahmoud
    Elbasiony, Reda M.
    2019 15TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO 2019), 2019, : 183 - 191
  • [12] A Survey of Sparse-learning Methods for Deep Neural Networks
    Ma, Rongrong
    Niu, Lingfeng
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 647 - 650
  • [13] Approximations with deep neural networks in Sobolev time-space
    Abdeljawad, Ahmed
    Grohs, Philipp
    ANALYSIS AND APPLICATIONS, 2022, 20 (03) : 499 - 541
  • [14] Automated Design of Deep Neural Networks: A Survey and Unified Taxonomy
    Talbi, El-Ghazali
    ACM COMPUTING SURVEYS, 2021, 54 (02)
  • [15] Vesti: An In-Memory Computing Processor for Deep Neural Networks Acceleration
    Jiang, Zhewei
    Yin, Shihui
    Kim, Minkyu
    Gupta, Tushar
    Seok, Mingoo
    Seo, Jae-sun
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1516 - 1521
  • [16] A Stochastic Modified Limited Memory BFGS for Training Deep Neural Networks
    Yousefi, Mahsa
    Calomardo, Angeles Martinez
    INTELLIGENT COMPUTING, VOL 2, 2022, 507 : 9 - 28
  • [17] Discriminative feature-space transforms using deep neural networks
    Saon, George
    Kingsbury, Brian
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 14 - 17
  • [18] UAV sensor data applications with deep neural networks: A comprehensive survey
    Dudukcu, Hatice Vildan
    Taskiran, Murat
    Kahraman, Nihan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [19] SAVING LIVES FROM ABOVE: PERSON DETECTION IN DISASTER RESPONSE USING DEEP NEURAL NETWORKS
    Bahmanyar, Reza
    Merkle, Nina
    GEOSPATIAL WEEK 2023, VOL. 10-1, 2023, : 343 - 350
  • [20] A Survey on Attacks and Their Countermeasures in Deep Learning: Applications in Deep Neural Networks, Federated, Transfer, and Deep Reinforcement Learning
    Ali, Haider
    Chen, Dian
    Harrington, Matthew
    Salazar, Nathaniel
    Al Ameedi, Mohannad
    Khan, Ahmad Faraz
    Butt, Ali R.
    Cho, Jin-Hee
    IEEE ACCESS, 2023, 11 : 120095 - 120130