NICE: Noise Injection and Clamping Estimation for Neural Network Quantization

被引:5
|
作者
Baskin, Chaim [1 ]
Zheltonozhkii, Evgenii [1 ]
Rozen, Tal [2 ]
Liss, Natan [2 ]
Chai, Yoav [3 ]
Schwartz, Eli [3 ]
Giryes, Raja [3 ]
Bronstein, Alexander M. [1 ]
Mendelson, Avi [1 ]
机构
[1] Technion, Dept Comp Sci, IL-3200003 Haifa, Israel
[2] Technion, Dept Elect Engn, IL-3200003 Haifa, Israel
[3] Tel Aviv Univ, Sch Elect Engn, IL-6997801 Tel Aviv, Israel
关键词
neural networks; low power; quantization; CNN architecture;
D O I
10.3390/math9172144
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Convolutional Neural Networks (CNNs) are very popular in many fields including computer vision, speech recognition, natural language processing, etc. Though deep learning leads to groundbreaking performance in those domains, the networks used are very computationally demanding and are far from being able to perform in real-time applications even on a GPU, which is not power efficient and therefore does not suit low power systems such as mobile devices. To overcome this challenge, some solutions have been proposed for quantizing the weights and activations of these networks, which accelerate the runtime significantly. Yet, this acceleration comes at the cost of a larger error unless spatial adjustments are carried out. The method proposed in this work trains quantized neural networks by noise injection and a learned clamping, which improve accuracy. This leads to state-of-the-art results on various regression and classification tasks, e.g., ImageNet classification with architectures such as ResNet-18/34/50 with as low as 3 bit weights and activations. We implement the proposed solution on an FPGA to demonstrate its applicability for low-power real-time applications. The quantization code will become publicly available upon acceptance.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Layer Importance Estimation with Imprinting for Neural Network Quantization
    Liu, Hongyang
    Elkerdawy, Sara
    Ray, Nilanjan
    Elhoushi, Mostafa
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2408 - 2417
  • [2] UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
    Baskin, Chaim
    Liss, Natan
    Schwartz, Eli
    Zheltonozhskii, Evgenii
    Giryes, Raja
    Bronstein, Alex M.
    Mendelson, Avi
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2021, 37 (1-4): : 1 - 4
  • [3] MORE ON ESTIMATION OF QUANTIZATION NOISE
    DAVYDOV, YG
    TELECOMMUNICATIONS AND RADIO ENGINEERING, 1972, (06) : 18 - 20
  • [4] Search region prediction for motion estimation based on neural network vector quantization
    Ryu, DaeHyun
    Kim, HyungJun
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 460 - 467
  • [5] QUANTIZATION NOISE ESTIMATION FOR LOG-PCM
    Konate, Mohamed
    Kabal, Peter
    2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 1337 - 1341
  • [6] Estimation of fractal parameters in the presence of quantization and noise
    Longo, M
    Marano, S
    Percannella, R
    Restaino, R
    IGARSS 2001: SCANNING THE PRESENT AND RESOLVING THE FUTURE, VOLS 1-7, PROCEEDINGS, 2001, : 1979 - 1981
  • [7] Robust Noise Estimation Based on Noise Injection
    Tang, Chongwu
    Yang, Xiaokang
    Zhai, Guangtao
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 69 - 78
  • [8] Robust Noise Estimation Based on Noise Injection
    Chongwu Tang
    Xiaokang Yang
    Guangtao Zhai
    Journal of Signal Processing Systems, 2014, 74 : 69 - 78
  • [9] Adaptive Quantization for Deep Neural Network
    Zhou, Yiren
    Moosavi-Dezfooli, Seyed-Mohsen
    Cheung, Ngai-Man
    Frossard, Pascal
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4596 - 4604
  • [10] Trainable Thresholds for Neural Network Quantization
    Goncharenko, Alexander
    Denisov, Andrey
    Alyamkin, Sergey
    Terentev, Evgeny
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 302 - 312