Rate-Accuracy Optimization of Deep Convolutional Neural Network Models

被引:0
|
作者
Filini, Alessandro [1 ]
Ascenso, Joao [2 ]
Leonardi, Riccardo [1 ]
机构
[1] Univ Brescia, Dipartimento Ingn Informaz, Brescia, Italy
[2] Inst Super Tecn, Inst Telecomunicacoes, Lisbon, Portugal
关键词
D O I
10.1109/ISM.2017.121
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, deep learning has enjoyed a great deal of success for computer vision problems due to its capability to model highly complex tasks, such as image classification, object detection, face recognition, among many others. Although these neural networks are nowadays very powerful, there is a huge amount of parameters (i.e. the model) that need to be learned and require considerable storage space and bandwidth during transmission. This paper addresses the problems of storage and transmission of large deep learning models by proposing a compression solution that is independent of the model being trained as well as the data used for training. An efficient compression framework for the parameters of a neural network, more precisely the weights that interconnect. the different neurons, which consume a significant amount of resources (memory, storage and bandwidth) is proposed. Several quantization strategies are considered as well as a statistical models 14 the different layers of a neural network, which are exploited by an arithmetic coding engine. Experimental results show that up to 92% bitrate savings can he obtained with minimal impact in terms of image classification accuracy.
引用
收藏
页码:91 / 98
页数:8
相关论文
共 50 条
  • [1] RATE-ACCURACY TRADE-OFF IN VIDEO CLASSIFICATION WITH DEEP CONVOLUTIONAL NEURAL NETWORKS
    Abbas, Alhabib
    Jubran, Mohammad
    Chadha, Aaron
    Andreopoulos, Yiannis
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 793 - 797
  • [2] Rate-Accuracy Trade-Off in Video Classification With Deep Convolutional Neural Networks
    Jubran, Mohammad
    Abbas, Alhabib
    Chadha, Aaron
    Andreopoulos, Yiannis
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (01) : 145 - 154
  • [3] RATE-ACCURACY OPTIMIZATION OF BINARY DESCRIPTORS
    Redondi, Alessandro
    Baroffio, Luca
    Ascenso, Joao
    Cesana, Matteo
    Tagliasacchi, Marco
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2910 - 2914
  • [4] RATE-ACCURACY OPTIMIZATION IN VISUAL WIRELESS SENSOR NETWORKS
    Redondi, A.
    Cesana, M.
    Tagliasacchi, M.
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1105 - 1108
  • [5] Development of Deep Convolutional Neural Network for Structural Topology Optimization
    Seo, Junhyeon
    Kapania, Rakesh K.
    AIAA JOURNAL, 2023, 61 (03) : 1366 - 1379
  • [6] Development of Deep Convolutional Neural Network for Structural Topology Optimization
    Seo, Junhyeon
    Kapania, Rakesh K.
    AIAA Journal, 2023, 61 (03): : 1366 - 1379
  • [7] A Modular Deep Convolutional Neural Network for Imroving Accuracy in Prostate Biopsies
    Kralev, Krasimir
    Mirinchev, Niklolay
    Sotirov, Sotir
    Sotirova, Evdokia
    Cholakova, Zlatka
    INTELLIGENT AND FUZZY SYSTEMS, INFUS 2024 CONFERENCE, VOL 1, 2024, 1088 : 302 - 308
  • [8] Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
    He, Zhezhi
    Gong, Boqing
    Fan, Deliang
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 913 - 921
  • [9] Deep convolutional neural network models for the diagnosis of thyroid cancer
    Li, Xiangchun
    Zhang, Sheng
    Zhang, Qiang
    Wei, Xi
    Gao, Ming
    Zhang, Wei
    Chen, Kexin
    LANCET ONCOLOGY, 2019, 20 (03): : E131 - E131
  • [10] Deep Convolutional Neural Network
    Zhou, Yu
    Fang, Rui
    Liu, Peng
    Liu, Kai
    2019 PROCEEDINGS OF THE CONFERENCE ON CONTROL AND ITS APPLICATIONS, CT, 2019, : 46 - 51