Generative Data Free Model Quantization With Knowledge Matching for Classification

被引:7
|
作者
Xu, Shoukai [1 ,2 ]
Zhang, Shuhai [1 ]
Liu, Jing [3 ]
Zhuang, Bohan [3 ]
Wang, Yaowei [2 ]
Tan, Mingkui [1 ,4 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[4] South China Univ Technol, Key Lab Big Data & Intelligent Robot, Minist Educ, Guangzhou 510006, Peoples R China
关键词
Data privacy and security; model compression; data free quantization; data generation; BINARY NEURAL-NETWORKS; IMAGE; SEGMENTATION; CONVOLUTION; ACCURATE; CNN;
D O I
10.1109/TCSVT.2023.3279281
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Neural network quantization aims to reduce the model size, computational complexity, and memory consumption by mapping weights and activations from full-precision to low-precision. However, many existing quantization methods, either post-training with calibration or quantization-aware training with fine-tuning, require original data for better performance, which may not be available due to confidentiality or privacy constraints. This lack of data can lead to a significant decline in performance. In this paper, we propose a universal and effective method called Generative Data Free Model Quantization with Knowledge Matching for Classification(KMDFQ) that removes the dependence on data for neural network quantization. To achieve this, we propose a knowledge matching generator that produces meaningful fake data based on the latent knowledge in the pre-trained model, including classification boundary knowledge and data distribution information. Based on this generator, we propose a fake-data driven data free quantization method that uses the generated data to take advantage of the latent knowledge for quantization. Furthermore, we introduce Mean Square Error alignment during the fine-tuning of the quantized model to more strictly and directly learn knowledge, making it more suitable for data free quantization. Extensive experiments on image classification demonstrate the effectiveness of our method, achieving higher accuracy than existing data free quantization methods, particularly as the quantization bit decreases. For example, on ImageNet, the 4-bit data free quantized ResNet-18 has less than a 1.2% accuracy decline compared to quantization with real data. The source code is available at https://github.com/ZSHsh98/KMDFQ.
引用
收藏
页码:7296 / 7309
页数:14
相关论文
共 50 条
  • [41] Conditional pseudo-supervised contrast for data-Free knowledge distillation
    Shao, Renrong
    Zhang, Wei
    Wang, Jun
    PATTERN RECOGNITION, 2023, 143
  • [42] CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-training Quantization of ViTs
    Ramachandran, Akshat
    Kundu, Souvik
    Krishna, Tushar
    COMPUTER VISION - ECCV 2024, PT LXVII, 2025, 15125 : 307 - 325
  • [43] A Category-Aware Curriculum Learning for Data-Free Knowledge Distillation
    Li, Xiufang
    Jiao, Licheng
    Sun, Qigong
    Liu, Fang
    Liu, Xu
    Li, Lingling
    Chen, Puhua
    Yang, Shuyuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9603 - 9618
  • [44] Advancing Trans-Domain Classification With Knowledge Distillation: Bridging LIDAR and Image Data
    Ortiz, Jesus Eduardo
    Creixell, Werner
    IEEE ACCESS, 2025, 13 : 20574 - 20583
  • [45] A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs
    George, Dileep
    Lehrach, Wolfgang
    Kansky, Ken
    Lazaro-Gredilla, Miguel
    Laan, Christopher
    Marthi, Bhaskara
    Lou, Xinghua
    Meng, Zhaoshi
    Liu, Yi
    Wang, Huayan
    Lavin, Alex
    Phoenix, D. Scott
    SCIENCE, 2017, 358 (6368)
  • [46] TasselGAN: An Application of the Generative Adversarial Model for Creating Field-Based Maize Tassel Data
    Shete, Snehal
    Srinivasan, Srikant
    Gonsalves, Timothy A.
    PLANT PHENOMICS, 2020, 2020 (2020):
  • [47] HDKD: Hybrid data-efficient knowledge distillation network for medical image classification
    EL-Assiouti, Omar S.
    Hamed, Ghada
    Khattab, Dina
    Ebied, Hala M.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [48] Human motion classification using 2D stick-model matching regression coefficients
    Chan, C. K.
    Loh, W. P.
    Abd Rahim, I.
    APPLIED MATHEMATICS AND COMPUTATION, 2016, 283 : 70 - 89
  • [49] An Industrial Short Text Classification Method Based on Large Language Model and Knowledge Base
    Yin, Haoran
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [50] Data-Free Knowledge Distillation for Privacy-Preserving Efficient UAV Networks
    Yu, Guyang
    2022 6TH INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION SCIENCES (ICRAS 2022), 2022, : 52 - 56