Rate-Accuracy Optimization of Deep Convolutional Neural Network Models

被引:0
|
作者
Filini, Alessandro [1 ]
Ascenso, Joao [2 ]
Leonardi, Riccardo [1 ]
机构
[1] Univ Brescia, Dipartimento Ingn Informaz, Brescia, Italy
[2] Inst Super Tecn, Inst Telecomunicacoes, Lisbon, Portugal
关键词
D O I
10.1109/ISM.2017.121
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, deep learning has enjoyed a great deal of success for computer vision problems due to its capability to model highly complex tasks, such as image classification, object detection, face recognition, among many others. Although these neural networks are nowadays very powerful, there is a huge amount of parameters (i.e. the model) that need to be learned and require considerable storage space and bandwidth during transmission. This paper addresses the problems of storage and transmission of large deep learning models by proposing a compression solution that is independent of the model being trained as well as the data used for training. An efficient compression framework for the parameters of a neural network, more precisely the weights that interconnect. the different neurons, which consume a significant amount of resources (memory, storage and bandwidth) is proposed. Several quantization strategies are considered as well as a statistical models 14 the different layers of a neural network, which are exploited by an arithmetic coding engine. Experimental results show that up to 92% bitrate savings can he obtained with minimal impact in terms of image classification accuracy.
引用
收藏
页码:91 / 98
页数:8
相关论文
共 50 条
  • [21] FP-DCNN: a parallel optimization algorithm for deep convolutional neural network
    Ye Le
    Y. A. Nanehkaran
    Deborah Simon Mwakapesa
    Ruipeng Zhang
    Jianbing Yi
    Yimin Mao
    The Journal of Supercomputing, 2022, 78 : 3791 - 3813
  • [22] Hybrid optimization assisted deep convolutional neural network for hardening prediction in steel
    Li, Changhong
    Yin, Chenbo
    Xu, Xingtian
    JOURNAL OF KING SAUD UNIVERSITY SCIENCE, 2021, 33 (06)
  • [23] DeepFrag: a deep convolutional neural network for fragment-based lead optimization
    Green, Harrison
    Koes, David R.
    Durrant, Jacob D.
    CHEMICAL SCIENCE, 2021, 12 (23) : 8036 - 8047
  • [24] FP-DCNN: a parallel optimization algorithm for deep convolutional neural network
    Le, Ye
    Nanehkaran, Y. A.
    Mwakapesa, Deborah Simon
    Zhang, Ruipeng
    Yi, Jianbing
    Mao, Yimin
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (03): : 3791 - 3813
  • [25] Memory Bandwidth and Energy Efficiency Optimization of Deep Convolutional Neural Network Accelerators
    Nie, Zikai
    Li, Zhisheng
    Wang, Lei
    Guo, Shasha
    Dou, Qiang
    ADVANCED COMPUTER ARCHITECTURE, 2018, 908 : 15 - 29
  • [26] Squirrel Search Optimization with Deep Convolutional Neural Network for Human Pose Estimation
    Ishwarya, K.
    Nithya, A. Alice
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6081 - 6099
  • [27] Detection of plant leaf diseases using deep convolutional neural network models
    Singla, Puja
    Kalavakonda, Vijaya
    Senthil, Ramalingam
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 64533 - 64549
  • [28] Transferring deep convolutional neural network models for generalization mapping of autumn crops
    Zhang F.
    Zhang J.
    Duan Y.
    Yang Z.
    National Remote Sensing Bulletin, 2024, 28 (03) : 661 - 676
  • [29] Structural Damage Identification Using Ensemble Deep Convolutional Neural Network Models
    Barkhordari, Mohammad Sadegh
    Armaghani, Danial Jahed
    Asteris, Panagiotis G.
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 134 (02): : 835 - 855
  • [30] Brain Stroke Detection Using Convolutional Neural Network and Deep Learning Models
    Gaidhani, Bhagyashree Rajendra
    Rajamenakshi, R.
    Sonavane, Samadhan
    2019 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT COMMUNICATION AND COMPUTATIONAL TECHNIQUES (ICCT), 2019, : 242 - 249