An Efficient Sparse CNNs Accelerator on FPGA

被引：1

作者：

Zhang, Yonghua ^{[1
]}

Jiang, Hongxu ^{[1
]}

Li, Xiaobin ^{[1
]}

Wang, Haojie ^{[2
]}

Dong, Dong ^{[1
]}

Cao, Yongxiang ^{[1
]}

机构：

[1] Beihang Univ, Beijing, Peoples R China

[2] Tsinghua Univ, Beijing, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2022) | 2022年

关键词：

Inference Acceleration; Sparse CNNs; Load Balancing; FPGA; Hash Mapping Conflict;

D O I：

10.1109/CLUSTER51413.2022.00063

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional Neural Networks (CNNs) have achieved remarkable performance at a huge computational cost. By improving the model sparsity, it can effectively reduce the complexity. However, with deepening of sparsity, the problems of unbalanced workloads, computing fragmentation and mapping access conflict caused by irregular sparsity have become more and more remarkable. These problems pose great challenges for efficient computation of sparse CNNs. In order to make full use of two side of sparsity introduced by activations and weights, and overcome the above problems, this paper proposes an efficient sparse CNNs accelerator on FPGA to achieve the inference acceleration. We designed and implemented the accelerator on the Zynq UltraScale+ MPSoC ZCU102 evaluation board. By running AlexNet, VGG16 and ResNet50 networks on the accelerator to evaluated the peeformance. Experimental results show that the method proposed in this paper can achieve more than 97% reduction in collision rate and 2.35x improvement in computing performance and 9.37x improvement in energy efficiency.

引用

页码：504 / 505

页数：2

共 4 条

[1] An Efficient Hardware Accelerator for Sparse Convolutional Neural Networks on FPGAs [J].

Lu, Liqiang ;

Xie, Jiaming ;

Huang, Ruirui ;

Zhang, Jiansong ;

Lin, Wei ;

Liang, Yun .

2019 27TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2019, :17-25

[2]

Ning Li, 2020, ICCAI '20: Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence, P534, DOI 10.1145/3404555.3404626

[3] SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks [J].

Parashar, Angshuman ;

Rhu, Minsoo ;

Mukkara, Anurag ;

Puglielli, Antonio ;

Venkatesan, Rangharajan ;

Khailany, Brucek ;

Emer, Joel ;

Keckler, Stephen W. ;

Dally, William J. .

44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, :27-40

[4] An Efficient Hardware Accelerator for Structured Sparse Convolutional Neural Networks on FPGAs [J].

Zhu, Chaoyang ;

Huang, Kejie ;

Yang, Shuyuan ;

Zhu, Ziqi ;

Zhang, Hejia ;

Shen, Haibin .

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (09) :1953-1965

← 1 →