A NOVEL SENSITIVITY METRIC FOR MIXED-PRECISION QUANTIZATION WITH SYNTHETIC DATA GENERATION

被引:1
|
作者
Lee, Donghyun [1 ]
Cho, Minkyoung [1 ]
Lee, Seungwon [1 ]
Song, Joonho [1 ]
Choi, Changkyu [1 ]
机构
[1] Samsung Elect, Samsung Adv Inst Technol, Suwon, South Korea
关键词
Deep Learning; Quantization; Data Free;
D O I
10.1109/ICIP42928.2021.9506527
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Post-training quantization is a representative technique for compressing neural networks, making them smaller and more efficient for deployment on edge devices. However, an inaccessible user dataset often makes it difficult to ensure the quality of the quantized neural network in practice. In addition, existing approaches may use a single uniform bit-width across the network, resulting in significant accuracy degradation at extremely low bit-widths. To utilize multiple bit-width, sensitivity metric plays a key role in balancing accuracy and compression. In this paper, we propose a novel sensitivity metric that considers the effect of quantization error on task loss and interaction with other layers. Moreover, we develop labeled data generation methods that are not dependent on a specific operation of the neural network. Our experiments show that the proposed metric better represents quantization sensitivity, and generated data are more feasible to apply to mixed-precision quantization.
引用
收藏
页码:1294 / 1298
页数:5
相关论文
共 50 条
  • [21] Mixed-precision Quantization with Dynamical Hessian Matrix for Object Detection Network
    Yang, Zerui
    Fei, Wen
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [22] Mixed-precision quantization-aware training for photonic neural networks
    Kirtas, Manos
    Passalis, Nikolaos
    Oikonomou, Athina
    Moralis-Pegios, Miltos
    Giamougiannis, George
    Tsakyridis, Apostolos
    Mourgias-Alexandris, George
    Pleros, Nikolaos
    Tefas, Anastasios
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (29): : 21361 - 21379
  • [23] Mixed-precision quantization-aware training for photonic neural networks
    Manos Kirtas
    Nikolaos Passalis
    Athina Oikonomou
    Miltos Moralis-Pegios
    George Giamougiannis
    Apostolos Tsakyridis
    George Mourgias-Alexandris
    Nikolaos Pleros
    Anastasios Tefas
    Neural Computing and Applications, 2023, 35 : 21361 - 21379
  • [24] Mixed-Precision Quantization of U-Net for Medical Image Segmentation
    Guo, Liming
    Fei, Wen
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2871 - 2875
  • [25] Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
    Chen, Weihan
    Wang, Peisong
    Cheng, Jian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5330 - 5339
  • [26] Mixed-precision Deep Neural Network Quantization With Multiple Compression Rates
    Wang, Xuanda
    Fei, Wen
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 371 - 371
  • [27] Mixed-precision quantization for neural networks based on error limit (Invited)
    Li Y.
    Guo Z.
    Liu K.
    Sun X.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2022, 51 (04):
  • [28] Entropy-Driven Mixed-Precision Quantization for Deep Network Design
    Sun, Zhenhong
    Ge, Ce
    Wang, Junyan
    Lin, Ming
    Chen, Hesen
    Li, Hao
    Sun, Xiuyu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] Neuroevolution-Enhanced Multi-Objective Optimization for Mixed-Precision Quantization
    Mira, Santiago
    Chua, Vui Seng
    Marder, Mattias
    Phiellip, Mariano
    Jain, Nilesh
    Majumdar, Somdeb
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22), 2022, : 1057 - 1065
  • [30] Hardware-Friendly Logarithmic Quantization with Mixed-Precision for MobileNetV2
    Choi, Dahun
    Kim, Hyun
    2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 348 - 351