A novel structured sparse fully connected layer in convolutional neural networks

被引：8

作者：

Matsumura, Naoki ^{[1
]}

Ito, Yasuaki ^{[1
]}

Nakano, Koji ^{[1
]}

Kasagi, Akihiko ^{[2
]}

Tabaru, Tsuguchika ^{[2
]}

机构：

[1] Hiroshima Univ, Dept Informat Engn, Higashihiroshima, Japan

[2] Fujitsu Labs Ltd, Kawasaki, Kanagawa, Japan

来源：

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2023年 / 35卷 / 11期

关键词：

convolutional neural network; GPU; model compression;

D O I：

10.1002/cpe.6213

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Convolutional Neural Networks (CNNs) are one of the factors supporting the rapid development of artificial intelligent techniques. However, as the ability of the network increases, the size of the network becomes larger. Thus far, several works related to reduction of the network size have been tackled. In many cases, these approaches produce an unstructured network which prevents efficient parallel computation. To avoid this problem, we propose a novel structured sparse fully connected layer (FCL) in the CNNs. The aim of our proposed approach is reduction of the number of network parameters in the FCLs which occupy a large part of network parameters. Unlike the general FCLs used in the popular CNNs such as VGG-16, the proposed approach reduces the connection between the last convolutional layer and the first FCL. In addition, we show an implementation for the proposed sparse FCLs on the GPU using cuBLAS. As a result for ILSVRC-2012 dataset, the proposed approach achieves a 21.3 times compression with 0.68% top-1 accuracy and 0.31% top-5 accuracy decreases for VGG-16. The implementation of the proposed FCLs achieves speed-up factor 14.97 and 16.67 for forward and backward propagation compared to that for the noncompressed FCLs, respectively.

引用

页数：19

共 22 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

[Anonymous], 2017, IEEE INT C COMPUT VI, DOI [10.1109/iccv.201, DOI 10.1109/ICCV.2017.322]

[3]

Courbariaux M., 2016, ARXIV

[4]

Courbariaux M, 2015, ADV NEUR IN, V28

[5] PERMDNN: Efficient Compressed DNN Architecture with Permuted Diagonal Matrices [J].

Deng, Chunhua ;

Liao, Siyu ;

Xie, Yi ;

Parhi, Keshab K. ;

Qian, Xuehai ;

Yuan, Bo .

2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2018, :189-202

[6]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[7]

Han S., 2016, arxiv preprint arXiv:1607.04381

[8]

Howard A.G., 2017, arXiv

[9]

Hu J., 2018, PROC IEEECVF C COMPU

[10]

Khosla A., 2011, P CVPR WORKSH FIN GR

← 1 2 3 →