Literature Review of Deep Network Compression

被引：24

作者：

Alqahtani, Ali ^{[1
,2
]}

Xie, Xianghua ^{[1
]}

Jones, Mark W. ^{[1
]}

机构：

[1] Swansea Univ, Dept Comp Sci, Swansea SA2 8PP, W Glam, Wales

[2] King Khalid Univ, Dept Comp Sci, Abha 62529, Saudi Arabia

来源：

INFORMATICS-BASEL | 2021年 / 8卷 / 04期

关键词：

deep learning; neural networks pruning; model compression; ACCELERATION; PRINCIPLES;

D O I：

10.3390/informatics8040077

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Deep networks often possess a vast number of parameters, and their significant redundancy in parameterization has become a widely-recognized property. This presents significant challenges and restricts many deep learning applications, making the focus on reducing the complexity of models while maintaining their powerful performance. In this paper, we present an overview of popular methods and review recent works on compressing and accelerating deep neural networks. We consider not only pruning methods but also quantization methods, and low-rank factorization methods. This review also intends to clarify these major concepts, and highlights their characteristics, advantages, and shortcomings.

引用

页数：12

共 76 条

[1] Neuron-based Network Pruning Based on Majority Voting [J].