Literature Review of Deep Network Compression

被引:19
作者
Alqahtani, Ali [1 ,2 ]
Xie, Xianghua [1 ]
Jones, Mark W. [1 ]
机构
[1] Swansea Univ, Dept Comp Sci, Swansea SA2 8PP, W Glam, Wales
[2] King Khalid Univ, Dept Comp Sci, Abha 62529, Saudi Arabia
来源
INFORMATICS-BASEL | 2021年 / 8卷 / 04期
关键词
deep learning; neural networks pruning; model compression; ACCELERATION; PRINCIPLES;
D O I
10.3390/informatics8040077
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep networks often possess a vast number of parameters, and their significant redundancy in parameterization has become a widely-recognized property. This presents significant challenges and restricts many deep learning applications, making the focus on reducing the complexity of models while maintaining their powerful performance. In this paper, we present an overview of popular methods and review recent works on compressing and accelerating deep neural networks. We consider not only pruning methods but also quantization methods, and low-rank factorization methods. This review also intends to clarify these major concepts, and highlights their characteristics, advantages, and shortcomings.
引用
收藏
页数:12
相关论文
共 76 条
  • [1] Neuron-based Network Pruning Based on Majority Voting
    Alqahtani, Ali
    Xie, Xianghua
    Essa, Ehab
    Jones, Mark W.
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3090 - 3097
  • [2] Pruning CNN filters via quantifying the importance of deep visual representations
    Alqahtani, Ali
    Xie, Xianghua
    Jones, Mark W.
    Essa, Ehab
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 208
  • [3] Accelerated implementation of FQSqueezer novel genomic compression method
    Amich, Monica
    De Luca, Pasquale
    Fiscale, Stefano
    [J]. 2020 19TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC 2020), 2020, : 158 - 163
  • [4] Arora S, 2018, PR MACH LEARN RES, V80
  • [5] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
    Bach, Sebastian
    Binder, Alexander
    Montavon, Gregoire
    Klauschen, Frederick
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PLOS ONE, 2015, 10 (07):
  • [6] Chen WL, 2015, PR MACH LEARN RES, V37, P2285
  • [7] Model Compression and Acceleration for Deep Neural Networks The principles, progress, and challenges
    Cheng, Yu
    Wang, Duo
    Zhou, Pan
    Zhang, Tao
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) : 126 - 136
  • [8] Courbariaux M., 2015, CoRR, P3123
  • [9] Courbariaux M., 2016, P INT C LEARN REPR S
  • [10] Denil M, 2013, NIPS