The Possibility of Combining and Implementing Deep Neural Network Compression Methods

被引:12
作者
Predic, Bratislav [1 ]
Vukic, Uros [1 ]
Saracevic, Muzafer [2 ]
Karabasevic, Darjan [3 ]
Stanujkic, Dragisa [4 ]
机构
[1] Univ Nis, Fac Elect Engn, Aleksandra Medvedeva 14, Nish 18000, Serbia
[2] Univ Novi Pazar, Dept Comp Sci, Novi Pazar 36300, Serbia
[3] Univ Business Acad Novi Sad, Fac Appl Management Econ & Finance, Jevrejska 24, Belgrade 11000, Serbia
[4] Univ Belgrade, Tech Fac Bor, Vojske Jugoslavije 12, Bor 19210, Serbia
关键词
deep learning; convolutional neural networks; deep neural network model; combining methods; implementation; SYSTEM; IOT;
D O I
10.3390/axioms11050229
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In the paper, the possibility of combining deep neural network (DNN) model compression methods to achieve better compression results was considered. To compare the advantages and disadvantages of each method, all methods were applied to the ResNet18 model for pretraining to the NCT-CRC-HE-100K dataset while using CRC-VAL-HE-7K as the validation dataset. In the proposed method, quantization, pruning, weight clustering, QAT (quantization-aware training), preserve cluster QAT (hereinafter PCQAT), and distillation were performed for the compression of ResNet18. The final evaluation of the obtained models was carried out on a Raspberry Pi 4 device using the validation dataset. The greatest model compression result on the disk was achieved by applying the PCQAT method, whose application led to a reduction in size of the initial model by as much as 45 times, whereas the greatest model acceleration result was achieved via distillation on the MobileNetV2 model. All methods led to the compression of the initial size of the model, with a slight loss in the model accuracy or an increase in the model accuracy in the case of QAT and weight clustering. INT8 quantization and knowledge distillation also led to a significant decrease in the model execution time.
引用
收藏
页数:21
相关论文
共 42 条
  • [21] Kather Jakob Nikolas., 2018, Zenodo
  • [22] Static Analysis of Information Systems for IoT Cyber Security: A Survey of Machine Learning Approaches
    Kotenko, Igor
    Izrailov, Konstantin
    Buinevich, Mikhail
    [J]. SENSORS, 2022, 22 (04)
  • [23] Deep neural network-based application partitioning and scheduling for hospitals and medical enterprises using IoT assisted mobile fog cloud
    Lakhan, Abdullah
    Mastoi, Qurat-Ul-Ain
    Elhoseny, Mohamed
    Memon, Muhammad Suleman
    Mohammed, Mazin Abed
    [J]. ENTERPRISE INFORMATION SYSTEMS, 2022, 16 (07)
  • [24] Li H., 2016, ARXIV
  • [25] ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
    Luo, Jian-Hao
    Wu, Jianxin
    Lin, Weiyao
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5068 - 5076
  • [26] A METHOD FOR NORMALIZING HISTOLOGY SLIDES FOR QUANTITATIVE ANALYSIS
    Macenko, Marc
    Niethammer, Marc
    Marron, J. S.
    Borland, David
    Woosley, John T.
    Guan, Xiaojun
    Schmitt, Charles
    Thomas, Nancy E.
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1 AND 2, 2009, : 1107 - +
  • [27] IoT based hydroponics system using Deep Neural Networks
    Mehra, Manav
    Saxena, Sameer
    Sankaranarayanan, Suresh
    Tom, Rijo Jackson
    Veeramanikandan, M.
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 155 : 473 - 486
  • [28] Mohammadi F. G., 2022, ARXIV
  • [29] Mohammed C.M., 2021, Int. J. Sci. Bus, V5, P42
  • [30] Murdoch W.J., 2019, P NATL ACAD SCI USA, DOI DOI 10.1073/PNAS.1900654116