A FIXED-POINT QUANTIZATION TECHNIQUE FOR CONVOLUTIONAL NEURAL NETWORKS BASED ON WEIGHT SCALING

被引:0
作者
Mitschke, Norbert [1 ]
Heizmann, Michael [1 ]
Noffz, Klaus-Henning [2 ]
Wittmann, Ralf [2 ]
机构
[1] Karlsruhe Inst Technol, IIIT, Hertzstr 16, D-76187 Karlsruhe, Germany
[2] Silicon Software, K Zuse Ring 28, D-68163 Mannheim, Germany
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
关键词
CNNs; Fixed Point Quantization; Image Processing; Machine Vision; Deep Learning;
D O I
10.1109/icip.2019.8803490
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In order to make convolutional neural networks (CNNs) usable on smaller or mobile devices, it is necessary to reduce the computing, energy and storage requirements of these networks. One can achieved this by a fixed-point quantization of weights and activations of a CNN, which are usually represented by 32-bit floating-point. In this paper, we present an adaption of convolutional and fully connected layers in order to obtain a high usage of the given value range of activations and weights. Therefore, we introduce scaling factors obtained by moving average to limit the weights and activations. Our model, quantized to 8 bit, outperforms the 7-layer baseline model from which it is derived and the naive quantization by several percentage points. Our method does not require any additional operations in the inference and both the weights and activations have a fixed radix point.
引用
收藏
页码:3836 / 3840
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 2016, P ICLR
[2]  
[Anonymous], 2017, ARXIV170404861
[3]  
[Anonymous], 2014, Training Deep Neural Networks with Low Precision Multiplications
[4]  
[Anonymous], 2010, AT T LABS
[5]  
[Anonymous], ADV NEURAL INFORM PR
[6]  
[Anonymous], 2017, CORR
[7]  
Chen Q, 2017, INT CONF ASIC, P148, DOI 10.1109/ASICON.2017.8252433
[8]  
Chollet F., 2015, Keras
[9]  
Dozat T., 2016, P ICLR 2016 WORKSH T, P1
[10]   Angel-Eye: A Complete Design Flow for Mapping CNN onto Customized Hardware [J].
Guo, Kaiyuan ;
Sui, Lingzhi ;
Qiu, Jiantao ;
Yao, Song ;
Han, Song ;
Wang, Yu ;
Yang, Huazhong .
2016 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2016, :24-29