Many convolutional neural network (CNN) accelerators are proposed to exploit the sparsity of the networks recently to enjoy the benefits of both computation and memory reduction. However, most accelerators cannot exploit the sparsity of both activations and weights. For those works that exploit both sparsity opportunities, they cannot achieve the stable load balance through a static scheduling (SS) strategy, which is vulnerable to the sparsity distribution. In this work, a balanced compressed sparse row format and a dynamic scheduling strategy are proposed to improve the load balance. A set-associate structure is also presented to tradeoff the load balance and hardware resource overhead. We propose SWM to accelerate the CNN inference, which supports both sparse convolution and sparse fully connected (FC) layers. SWM provides Winograd adaptability for large convolution kernels and supports both 16-bit and 8-bit quantized CNNs. Due to the activation sharing, 8-bit processing can achieve theoretically twice the performance of the 16-bit processing with the same sparsity. The architecture is evaluated with VGG16 and ResNet50, which achieves: at most 7.6 TOP/s for sparse-Winograd convolution and three TOP/s for sparse matrix multiplication with 16-bit quantization on Xilinx VCU1525 platform. SWM can process 310/725 images per second for VGG16/ResNet50 with 16-bit quantization. Compared with the state-of-the-art works, our design can achieve at least 1.53x speedup and 1.8x energy efficiency improvement.
机构:
Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R ChinaXi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R China
Meng, Yishuo
Yang, Chen
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R ChinaXi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R China
Yang, Chen
Xiang, Siwei
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R ChinaXi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R China
Xiang, Siwei
Wang, Jianfei
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R ChinaXi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R China
Wang, Jianfei
Mei, Kuizhi
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R ChinaXi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R China
Mei, Kuizhi
Geng, Li
论文数: 0引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R ChinaXi An Jiao Tong Univ, Sch Microelect, Xian 710049, Shaanxi, Peoples R China
机构:
Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
Chinese Acad Sci, Inst Automat, Brain Inspired Cognit Intelligence Lab, Beijing 100190, Peoples R ChinaUniv Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
Li, Jindong
Shen, Guobin
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Brain Inspired Cognit Intelligence Lab, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Sch Future Technol, Beijing 100049, Peoples R ChinaUniv Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
Shen, Guobin
Zhao, Dongcheng
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Brain Inspired Cognit Intelligence Lab, Beijing 100190, Peoples R ChinaUniv Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
Zhao, Dongcheng
Zhang, Qian
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
Chinese Acad Sci, Inst Automat, Brain Inspired Cognit Intelligence Lab, Beijing 100190, Peoples R ChinaUniv Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
Zhang, Qian
Zeng, Yi
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Automat, Brain Inspired Cognit Intelligence Lab, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Beijing 100049, Peoples R China
Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Shanghai 200031, Peoples R ChinaUniv Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
机构:
Zhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China
Fang, Ruyi
Liang, Chu
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China
Liang, Chu
Xia, Yang
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China
Xia, Yang
Xiao, Zhen
论文数: 0引用数: 0
h-index: 0
机构:
China Jiliang Univ, Coll Mat Sci & Engn, Hangzhou 310018, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China
Xiao, Zhen
Huang, Hui
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China
Huang, Hui
Gan, Yongping
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China
Gan, Yongping
Zhang, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China
Zhang, Jun
Tao, Xinyong
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China
Tao, Xinyong
Zhang, Wenkui
论文数: 0引用数: 0
h-index: 0
机构:
Zhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R ChinaZhejiang Univ Technol, Coll Mat Sci & Engn, Hangzhou 310014, Zhejiang, Peoples R China