NICE: Noise Injection and Clamping Estimation for Neural Network Quantization

被引:5
|
作者
Baskin, Chaim [1 ]
Zheltonozhkii, Evgenii [1 ]
Rozen, Tal [2 ]
Liss, Natan [2 ]
Chai, Yoav [3 ]
Schwartz, Eli [3 ]
Giryes, Raja [3 ]
Bronstein, Alexander M. [1 ]
Mendelson, Avi [1 ]
机构
[1] Technion, Dept Comp Sci, IL-3200003 Haifa, Israel
[2] Technion, Dept Elect Engn, IL-3200003 Haifa, Israel
[3] Tel Aviv Univ, Sch Elect Engn, IL-6997801 Tel Aviv, Israel
关键词
neural networks; low power; quantization; CNN architecture;
D O I
10.3390/math9172144
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Convolutional Neural Networks (CNNs) are very popular in many fields including computer vision, speech recognition, natural language processing, etc. Though deep learning leads to groundbreaking performance in those domains, the networks used are very computationally demanding and are far from being able to perform in real-time applications even on a GPU, which is not power efficient and therefore does not suit low power systems such as mobile devices. To overcome this challenge, some solutions have been proposed for quantizing the weights and activations of these networks, which accelerate the runtime significantly. Yet, this acceleration comes at the cost of a larger error unless spatial adjustments are carried out. The method proposed in this work trains quantized neural networks by noise injection and a learned clamping, which improve accuracy. This leads to state-of-the-art results on various regression and classification tasks, e.g., ImageNet classification with architectures such as ResNet-18/34/50 with as low as 3 bit weights and activations. We implement the proposed solution on an FPGA to demonstrate its applicability for low-power real-time applications. The quantization code will become publicly available upon acceptance.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Radial basis function neural network predictor for parameter estimation in chaotic noise
    Xie, Hongmei
    Feng, Xiaoyi
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 135 - +
  • [22] Application of Artificial Neural Network for Image Noise Level Estimation in the SVD domain
    Turajlic, Emir
    Begovic, Alen
    Skaljo, Namir
    ELECTRONICS, 2019, 8 (02)
  • [23] Parallel Verification for δ-Equivalence of Neural Network Quantization
    Huang, Pei
    Yang, Yuting
    Wu, Haoze
    Daukantas, Ieva
    Wu, Min
    Jia, Fuqi
    Barrett, Clark
    AI VERIFICATION, SAIV 2024, 2024, 14846 : 78 - 99
  • [24] LEARNING VECTOR QUANTIZATION FOR THE PROBABILISTIC NEURAL NETWORK
    BURRASCANO, P
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (04): : 458 - 461
  • [25] A Quantization Framework for Neural Network Adaption at the Edge
    Li, Mengyuan
    Hu, Xiaobo Sharon
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 402 - 407
  • [26] Mirror Descent View for Neural Network Quantization
    Ajanthan, Thalaiyasingam
    Gupta, Kartik
    Torr, Philip H. S.
    Hartley, Richard
    Dokania, Puneet K.
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [27] Neural network quantization in federated learning at the edge
    Tonellotto, Nicola
    Gotta, Alberto
    Nardini, Franco Maria
    Gadler, Daniele
    Silvestri, Fabrizio
    Information Sciences, 2021, 575 : 417 - 436
  • [28] DEPENDENT SCALAR QUANTIZATION FOR NEURAL NETWORK COMPRESSION
    Haase, Paul
    Schwarz, Heiko
    Kirchhoffer, Heiner
    Wiedemann, Simon
    Marinc, Talmaj
    Marban, Arturo
    Mueller, Karsten
    Samek, Wojciech
    Marpe, Detlev
    Wiegand, Thomas
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 36 - 40
  • [29] Learnable Lookup Table for Neural Network Quantization
    Wang, Longguang
    Dong, Xiaoyu
    Wang, Yingqian
    Liu, Li
    An, Wei
    Guo, Yulan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 12413 - 12423
  • [30] Counterexample Guided Neural Network Quantization Refinement
    Matos, Joao Batista P.
    de Lima Filho, Eddie B.
    Bessa, Iury
    Manino, Edoardo
    Song, Xidan
    Cordeiro, Lucas C.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (04) : 1121 - 1134