Generative Data Free Model Quantization With Knowledge Matching for Classification

被引:7
|
作者
Xu, Shoukai [1 ,2 ]
Zhang, Shuhai [1 ]
Liu, Jing [3 ]
Zhuang, Bohan [3 ]
Wang, Yaowei [2 ]
Tan, Mingkui [1 ,4 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[4] South China Univ Technol, Key Lab Big Data & Intelligent Robot, Minist Educ, Guangzhou 510006, Peoples R China
关键词
Data privacy and security; model compression; data free quantization; data generation; BINARY NEURAL-NETWORKS; IMAGE; SEGMENTATION; CONVOLUTION; ACCURATE; CNN;
D O I
10.1109/TCSVT.2023.3279281
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Neural network quantization aims to reduce the model size, computational complexity, and memory consumption by mapping weights and activations from full-precision to low-precision. However, many existing quantization methods, either post-training with calibration or quantization-aware training with fine-tuning, require original data for better performance, which may not be available due to confidentiality or privacy constraints. This lack of data can lead to a significant decline in performance. In this paper, we propose a universal and effective method called Generative Data Free Model Quantization with Knowledge Matching for Classification(KMDFQ) that removes the dependence on data for neural network quantization. To achieve this, we propose a knowledge matching generator that produces meaningful fake data based on the latent knowledge in the pre-trained model, including classification boundary knowledge and data distribution information. Based on this generator, we propose a fake-data driven data free quantization method that uses the generated data to take advantage of the latent knowledge for quantization. Furthermore, we introduce Mean Square Error alignment during the fine-tuning of the quantized model to more strictly and directly learn knowledge, making it more suitable for data free quantization. Extensive experiments on image classification demonstrate the effectiveness of our method, achieving higher accuracy than existing data free quantization methods, particularly as the quantization bit decreases. For example, on ImageNet, the 4-bit data free quantized ResNet-18 has less than a 1.2% accuracy decline compared to quantization with real data. The source code is available at https://github.com/ZSHsh98/KMDFQ.
引用
收藏
页码:7296 / 7309
页数:14
相关论文
共 50 条
  • [31] Empirical Study of Data-Free Iterative Knowledge Distillation
    Shah, Het
    Vaswani, Ashwin
    Dash, Tirtharaj
    Hebbalaguppe, Ramya
    Srinivasan, Ashwin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 546 - 557
  • [32] A New Cascade Model for the Hierarchical Joint Classification of Multitemporal and Multiresolution Remote Sensing Data
    Hedhli, Ihsen
    Moser, Gabriele
    Zerubia, Josiane
    Serpico, Sebastiano Bruno
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (11): : 6333 - 6348
  • [33] Deep Convolutional Generative Adversarial Network-Based EMG Data Enhancement for Hand Motion Classification
    Chen, Zihan
    Qian, Yaojia
    Wang, Yuxi
    Fang, Yinfeng
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [34] Data-free knowledge distillation via generator-free data generation for Non-IID federated learning
    Zhao, Siran
    Liao, Tianchi
    Fu, Lele
    Chen, Chuan
    Bian, Jing
    Zheng, Zibin
    NEURAL NETWORKS, 2024, 179
  • [35] Synthetic data generation method for data-free knowledge distillation in regression neural networks
    Zhou, Tianxun
    Chiam, Keng-Hwee
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
  • [36] Innovative Cucumber Phenotyping: A Smartphone-Based and Data-Labeling-Free Model
    Nguyen, Le Quan
    Shin, Jihye
    Ryu, Sanghuyn
    Dang, L. Minh
    Park, Han Yong
    Lee, O. New
    Moon, Hyeonjoon
    ELECTRONICS, 2023, 12 (23)
  • [37] CNN-RNN and Data Augmentation Using Deep Convolutional Generative Adversarial Network for Environmental Sound Classification
    Bahmei, Behnaz
    Birmingham, Elina
    Arzanpour, Siamak
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 682 - 686
  • [38] MMCNet: deep learning–based multimodal classification model using dynamic knowledge
    Park S.-S.
    Chung K.
    Personal and Ubiquitous Computing, 2022, 26 (02) : 355 - 364
  • [39] Design and Implementation of Inspection Model for knowledge Patterns Classification in Diabetic Retinal Images
    Kothare, Kajal Sanjay
    Malpe, Kalpana
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 1220 - 1223
  • [40] Remote Sensing Image Scene Classification Model Based on Dual Knowledge Distillation
    Li, Daxiang
    Nan, Yixuan
    Liu, Ying
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19