Winograd Convolution for DNNs: Beyond Linear Polynomials

被引:5
作者
Barabasz, Barbara [1 ]
Gregg, David [1 ]
机构
[1] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin 2, Ireland
来源
ADVANCES IN ARTIFICIAL INTELLIGENCE, AI*IA 2019 | 2019年 / 11946卷
基金
爱尔兰科学基金会;
关键词
DNN; Convolution; Winograd convolution; Accuracy; Floating point; COMPLEXITY;
D O I
10.1007/978-3-030-35166-3_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Winograd convolution is widely used in deep neural networks (DNNs). Existing work for DNNs considers only the subset Winograd algorithms that are equivalent to Toom-Cook convolution. We investigate a wider range of Winograd algorithms for DNNs and show that these additional algorithms can significantly improve floating point (FP) accuracy in many cases. We present results for three FP formats: fp32, fp16 and bf16 (a truncated form of fp32) using 2000 inputs from the ImageNet dataset. We found that in fp16 this approach gives us up to 6.5 times better image recognition accuracy in one important case while maintaining the same number of elementwise multiplication operations in the innermost loop. In bf16 the convolution can be computed using 5% fewer innermost loop multiplications than with currently used Winograd algorithms while keeping the accuracy of image recognition the same as for direct convolution method.
引用
收藏
页码:307 / 320
页数:14
相关论文
共 13 条
[1]  
[Anonymous], 1997, Algorithms for Discrete Fourier Transforms and Convolution
[2]  
[Anonymous], 1980, ARITHMETIC COMPLEXIT, DOI 10.1137/1.9781611970364
[3]  
Barabasz B., 2018, ARXIV
[4]  
Biggs N., 2002, Discrete Mathematics
[5]  
Blahut RE, 2010, Fast algorithms for signal processing
[6]  
Cook S., 1966, Ph.D. thesis
[7]   Fast Algorithms for Convolutional Neural Networks [J].
Lavin, Andrew ;
Gray, Scott .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4013-4021
[8]  
Meng Lingchuan, 2019, ARXIV
[9]  
Selesnick I. W., 1994, 1994 IEEE International Symposium on Circuits and Systems (Cat. No.94CH3435-5), P449, DOI 10.1109/ISCAS.1994.408999
[10]  
Simonyan A., 2015, P INT C LEARN REPR A, P1