Quantized Guided Pruning for Efficient Hardware Implementations of Deep Neural Networks

被引:0
作者
Hacene, Ghouthi Boukli [1 ,2 ]
Gripon, Vincent [2 ]
Arzel, Matthieu [2 ]
Farrugia, Nicolas [2 ]
Bengio, Yoshua [1 ]
机构
[1] IMT Atlantique, Lab STICC, Nantes, France
[2] Univ Montreal, MILA, Montreal, PQ, Canada
来源
2020 18TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS'20) | 2020年
关键词
D O I
10.1109/newcas49341.2020.9159769
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep Neural Networks (DNNs) in general and Convolutional Neural Networks (CNNs) in particular are state-of-the-art in numerous computer vision tasks such as object classification and detection. However, the large amount of parameters they contain leads to a high computational complexity and strongly limits their usability in budget-constrained devices such as embedded devices. In this paper, we propose a combination of a pruning technique and a quantization scheme that effectively reduce the complexity and memory usage of convolutional layers of CNNs, by replacing the complex convolutional operation by a low-cost multiplexer. We perform experiments on CIFAR10, CIFAR100 and SVHN datasets and show that the proposed method achieves almost state-of-the-art accuracy, while drastically reducing the computational and memory footprints compared to the baselines. We also propose an efficient hardware architecture, implemented on Field Programmable Gate Arrays (FPGAs), to accelerate inference, which works as a pipeline and accommodates multiple layers working at the same time to speed up the inference process. In contrast with most proposed approaches which have used external memory or software defined memory controllers, our work is based on algorithmic optimization and full-hardware design, enabling a direct, on-chip memory implementation of a DNN while keeping close to state of the art accuracy.
引用
收藏
页码:206 / 209
页数:4
相关论文
共 50 条
  • [21] An efficient pruning scheme of deep neural networks for Internet of Things applications
    Qi, Chen
    Shen, Shibo
    Li, Rongpeng
    Zhao, Zhifeng
    Liu, Qing
    Liang, Jing
    Zhang, Honggang
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2021, 2021 (01)
  • [22] An efficient pruning scheme of deep neural networks for Internet of Things applications
    Chen Qi
    Shibo Shen
    Rongpeng Li
    Zhifeng Zhao
    Qing Liu
    Jing Liang
    Honggang Zhang
    EURASIP Journal on Advances in Signal Processing, 2021
  • [23] Efficient Hardware Architectures for Accelerating Deep Neural Networks: Survey
    Dhilleswararao, Pudi
    Boppu, Srinivas
    Manikandan, M. Sabarimalai
    Cenkeramaddi, Linga Reddy
    IEEE ACCESS, 2022, 10 : 131788 - 131828
  • [24] Hardware Efficient Convolution Processing Unit for Deep Neural Networks
    Hazarika, Anakhi
    Poddar, Soumyajit
    Rahaman, Hafizur
    2019 2ND INTERNATIONAL SYMPOSIUM ON DEVICES, CIRCUITS AND SYSTEMS (ISDCS 2019), 2019,
  • [25] HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration
    Yu, Fang
    Han, Chuanqi
    Wang, Pengcheng
    Huang, Ruoran
    Huang, Xi
    Cui, Li
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 255 - 262
  • [26] Neural Networks: Efficient Implementations and Applications
    Zhang, Chuan
    Xu, Weihong
    2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 1029 - 1032
  • [27] Methods for Pruning Deep Neural Networks
    Vadera, Sunil
    Ameen, Salem
    IEEE ACCESS, 2022, 10 : 63280 - 63300
  • [28] Spiking Neural Networks Hardware Implementations and Challenges: A Survey
    Bouvier, Maxence
    Valentian, Alexandre
    Mesquida, Thomas
    Rummens, Francois
    Reyboz, Marina
    Vianello, Elisa
    Beigne, Edith
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2019, 15 (02)
  • [29] Sparse optimization guided pruning for neural networks
    Shi, Yong
    Tang, Anda
    Niu, Lingfeng
    Zhou, Ruizhi
    NEUROCOMPUTING, 2024, 574
  • [30] A Review of Algorithms and Hardware Implementations for Spiking Neural Networks
    Duy-Anh Nguyen
    Xuan-Tu Tran
    Iacopi, Francesca
    JOURNAL OF LOW POWER ELECTRONICS AND APPLICATIONS, 2021, 11 (02)