Generative Data Free Model Quantization With Knowledge Matching for Classification

被引:7
|
作者
Xu, Shoukai [1 ,2 ]
Zhang, Shuhai [1 ]
Liu, Jing [3 ]
Zhuang, Bohan [3 ]
Wang, Yaowei [2 ]
Tan, Mingkui [1 ,4 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[3] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[4] South China Univ Technol, Key Lab Big Data & Intelligent Robot, Minist Educ, Guangzhou 510006, Peoples R China
关键词
Data privacy and security; model compression; data free quantization; data generation; BINARY NEURAL-NETWORKS; IMAGE; SEGMENTATION; CONVOLUTION; ACCURATE; CNN;
D O I
10.1109/TCSVT.2023.3279281
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Neural network quantization aims to reduce the model size, computational complexity, and memory consumption by mapping weights and activations from full-precision to low-precision. However, many existing quantization methods, either post-training with calibration or quantization-aware training with fine-tuning, require original data for better performance, which may not be available due to confidentiality or privacy constraints. This lack of data can lead to a significant decline in performance. In this paper, we propose a universal and effective method called Generative Data Free Model Quantization with Knowledge Matching for Classification(KMDFQ) that removes the dependence on data for neural network quantization. To achieve this, we propose a knowledge matching generator that produces meaningful fake data based on the latent knowledge in the pre-trained model, including classification boundary knowledge and data distribution information. Based on this generator, we propose a fake-data driven data free quantization method that uses the generated data to take advantage of the latent knowledge for quantization. Furthermore, we introduce Mean Square Error alignment during the fine-tuning of the quantized model to more strictly and directly learn knowledge, making it more suitable for data free quantization. Extensive experiments on image classification demonstrate the effectiveness of our method, achieving higher accuracy than existing data free quantization methods, particularly as the quantization bit decreases. For example, on ImageNet, the 4-bit data free quantized ResNet-18 has less than a 1.2% accuracy decline compared to quantization with real data. The source code is available at https://github.com/ZSHsh98/KMDFQ.
引用
收藏
页码:7296 / 7309
页数:14
相关论文
共 50 条
  • [21] A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification
    Rokh, Babak
    Azarpeyvand, Ali
    Khanteymoori, Alireza
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (06)
  • [22] Insufficient Data Generative Model for Pipeline Network Leak Detection Using Generative Adversarial Networks
    Zhang, Huaguang
    Hu, Xuguang
    Ma, Dazhong
    Wang, Rui
    Xie, Xiangpeng
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 7107 - 7120
  • [23] Data-Free Ensemble Knowledge Distillation for Privacy-conscious Multimedia Model Compression
    Hao, Zhiwei
    Luo, Yong
    Hu, Han
    An, Jianping
    Wen, Yonggang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1803 - 1811
  • [24] HaCk: Hand Gesture Classification Using a Convolutional Neural Network and Generative Adversarial Network-Based Data Generation Model
    Chatterjee, Kalyan
    Raju, M.
    Selvamuthukumaran, N.
    Pramod, M.
    Kumar, B. Krishna
    Bandyopadhyay, Anjan
    Mallik, Saurav
    INFORMATION, 2024, 15 (02)
  • [25] Adversarial Attacks and Defense on an Aircraft Classification Model Using a Generative Adversarial Network
    Colter, Jamison
    Kinnison, Matthew
    Henderson, Alex
    Harbour, Steven
    2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,
  • [26] Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization
    Gao, Yangcheng
    Zhang, Zhao
    Hong, Richang
    Zhang, Haijun
    Fan, Jicong
    Yan, Shuicheng
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 141 - 150
  • [27] Generative Large Model-Driven Methodology for Color Matching and Shape Design in IP Products
    Wu, Fan
    Lu, Peng
    Hsiao, Shih-Wen
    ENTROPY, 2025, 27 (03)
  • [28] PSAQ-ViT V2: Toward Accurate and General Data-Free Quantization for Vision Transformers
    Li, Zhikai
    Chen, Mengjuan
    Xiao, Junrui
    Gu, Qingyi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 17227 - 17238
  • [29] Classification of hyperspectral urban data using adaptive simultaneous orthogonal matching pursuit
    Zou, Jinyi
    Li, Wei
    Huang, Xin
    Du, Qian
    JOURNAL OF APPLIED REMOTE SENSING, 2014, 8
  • [30] Data-Free Low-Bit Quantization for Remote Sensing Object Detection
    Zhang, Ruiyan
    Jiang, Xiujie
    An, Junshe
    Cui, Tianshu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19