aHCQ: Adaptive Hierarchical Clustering Based Quantization Framework for Deep Neural Networks

被引:0
|
作者
Hu, Jiaxin [1 ]
Rao, Weixiong [1 ]
Zhao, Qinpei [1 ]
机构
[1] Tongji Univ, Sch Software Engn, Shanghai, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II | 2021年 / 12713卷
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Deep neural network; Hierarchical clustering; Network quantization; Compression rate;
D O I
10.1007/978-3-030-75765-6_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For deep neural networks (DNNs), a high model accuracy is usually the main focus. However, millions of model parameters commonly lead to high space overheads, especially parameter redundancy. By maintaining network weights with less bit-widths, network quantization has been used to compress DNNs for lower space costs. However, existing quantization methods cannot well optimally balance the model size and the accuracy, thus they suffer from the accuracy loss more or less. Besides, though few of existing quantization techniques can adaptively determine layers quantization bit-widths, they either give little consideration on the relations of different DNN layers, or are designed for special hardware environment that are not universal in broad computer fields. To overcome these issues, we propose an adaptive Hierarchical Clustering based Quantization (aHCQ) framework. The aHCQ can find a largely compressed model from the quantization of each layer and take only little loss on the model accuracy. It is shown from the experiments that the aHCQ can achieve 11.4x and 8.2x model compression rates with only around 0.5% drop of the model accuracy.
引用
收藏
页码:207 / 218
页数:12
相关论文
共 50 条
  • [41] Adaptive Search-based Repair of Deep Neural Networks
    Calsi, Davide Li
    Duran, Matias
    Laurent, Thomas
    Zhang, Xiao-Yi
    Arcaini, Paolo
    Ishikawa, Fuyuki
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2023, 2023, : 1527 - 1536
  • [42] Adaptive Robust Watermarking Method Based on Deep Neural Networks
    Li, Fan
    Wan, Chen
    Huang, Fangjun
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2022, 2023, 13825 : 162 - 173
  • [43] RETRIEVAL PROPERTIES OF NEURAL NETWORKS WITH HIERARCHICAL-CLUSTERING
    IDIART, MAP
    THEUMANN, A
    JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1991, 24 (11): : L649 - L658
  • [44] Learning Hierarchical Graph Neural Networks for Image Clustering
    Xing, Yifan
    He, Tong
    Xiao, Tianjun
    Wang, Yongxin
    Xiong, Yuanjun
    Xia, Wei
    Wipf, David
    Zhang, Zheng
    Soatto, Stefano
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3447 - 3457
  • [45] Context Quantization based on Minimum Description Length and Hierarchical Clustering
    Chen, Hui
    Chen, Jianhua
    2016 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2016), 2016, 56
  • [46] Clustering-based classification of road traffic accidents using hierarchical clustering and artificial neural networks
    Taamneh, Madhar
    Taamneh, Salah
    Alkheder, Sharaf
    INTERNATIONAL JOURNAL OF INJURY CONTROL AND SAFETY PROMOTION, 2017, 24 (03) : 388 - 395
  • [47] HIERARCHICAL PART DETECTION WITH DEEP NEURAL NETWORKS
    Cervantes, Esteve
    Yu, Long Long
    Bagdanov, Andrew D.
    Masana, Marc
    van de Wojer, Joost
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1933 - 1937
  • [48] A Study on Deep Neural Networks Framework
    Huang Yi
    Duan Xiusheng
    Sun Shiyu
    Chen Zhigang
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 1519 - 1522
  • [49] Hierarchical Weight Averaging for Deep Neural Networks
    Gu, Xiaozhe
    Zhang, Zixun
    Jiang, Yuncheng
    Luo, Tao
    Zhang, Ruimao
    Cui, Shuguang
    Li, Zhen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12276 - 12287
  • [50] GRAPHNET: GRAPH CLUSTERING WITH DEEP NEURAL NETWORKS
    Zhang, Xianchao
    Mu, Jie
    Liu, Han
    Zhang, Xiaotong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3800 - 3804