Adaptive Integer Quantisation for Convolutional Neural Networks through Evolutionary Algorithms

被引：0

作者：

Wang, Ziwei ^{[1
]}

Trefzer, Martin A. ^{[1
]}

Bale, Simon J. ^{[1
]}

Tyrrell, Andy M. ^{[1
]}

机构：

[1] Univ York, Dept Elect Engn, York, N Yorkshire, England

来源：

2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) | 2021年

关键词：

Convolutional Neural Networks; Quantisation; Evolutionary Algorithms; Deep Learning;

D O I：

10.1109/SSCI50451.2021.9659887

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

State-of-the-art Convolutional Neural Networks (CNNs) have become increasingly accurate. However, hundreds or thousands of megabytes data are involved to store them, making these networks also computationally expensive. For certain applications, such as Internet-of-Things (IoT), where such CNNs are to be implemented on resource-constrained and memory-constrained platforms, including Field-Programmable Gate Arrays (FPGAs) and embedded devices, CNN architectures and parameters have to be small and efficient. In this paper, an evolutionary algorithm (EA) based adaptive integer quantisation method is proposed to reduce network size. The proposed method uses single objective rank-based evolutionary strategy to find the best quantisation bin boundary for fixed quantised bit width. The performance of the proposed method is evaluated on a small CNN, the LeNet-5 architecture, using the CIFAR-10 dataset. The aim is to devise a methodology that allows adaptive quantisation of both weights and bias from 32-bit floating point to 8-bit integer representation for LeNet-5, while retaining accuracy. The experiments compare straight-forward (linear) quantisation from 32-bits to 8-bits with the proposed adaptive quantisation method. The results show that the proposed method is capable of quantising CNNs to lower bit width representation with only a slight loss in classification accuracy.

引用

页数：7

共 30 条

[1]

[Anonymous], 1991, VECTOR QUANTIZATION

[2] Evolutionary algorithms for real world applications [J].

Back, Thomas ;

Emmerich, M. ;

Shir, O. M. .

IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2008, 3 (01) :64-67

[3]

Bhuvaneswari M.C., 2014, Application of Evolutionary Algorithms for Multi-objective Optimization in VLSI and Embedded Systems

[4]

Courbariaux Matthieu, 2016, NeurIPS

[5]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[6]

Denil M., 2013, ADV NEURAL INFORM PR, P2148

[7]

Gong Yunchao, 2014, CORR ABS1412

[8]

Gupta S, 2015, PR MACH LEARN RES, V37, P1737

[9]

Kingma DP, 2014, ADV NEUR IN, V27

[10]

Krishnamoorthi Raghuraman, 2018, Quantizing deep convolutional networks for efficient inference: A whitepaper

← 1 2 3 →