Fast Algorithms for Convolutional Neural Networks

被引:561
作者
Lavin, Andrew [1 ]
Gray, Scott [1 ]
机构
[1] Nervana Syst, San Diego, CA 92121 USA
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
关键词
D O I
10.1109/CVPR.2016.435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks take GPU-days of computation to train on large data sets. Pedestrian detection for self driving cars requires very low latency. Image recognition for mobile phones is constrained by limited processing resources. The success of convolutional neural networks in these situations is limited by how fast we can compute them. Conventional FFT based convolution is fast for large filters, but state of the art convolutional neural networks use small, 3 x 3 filters. We introduce a new class of fast algorithms for convolutional neural networks using Winograd's minimal filtering algorithms. The algorithms compute minimal complexity convolution over small tiles, which makes them fast with small filters and small batch sizes. We benchmark a GPU implementation of our algorithm with the VGG network and show state of the art throughput at batch sizes from 1 to 64.
引用
收藏
页码:4013 / 4021
页数:9
相关论文
共 50 条
[41]   Complex Convolutional Neural Networks for Fast Diverging Wave Imaging [J].
Lu, Jingfeng ;
Millioz, Fabien ;
Garcia, Damien ;
Salles, Sebastien ;
Ye, Dong ;
Friboulet, Denis .
PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2020,
[42]   Convolutional Neural Networks and Regression Algorithms Supporting Buildings Facility Management [J].
Matos, Raquel ;
Rodrigues, Hugo ;
Costa, Anibal ;
Rodrigues, Fernanda .
BUILDINGS, 2023, 13 (11)
[43]   Exploring Heterogeneous Algorithms for Accelerating Deep Convolutional Neural Networks on FPGAs [J].
Xiao, Qincheng ;
Liang, Yun ;
Lu, Liqiang ;
Yan, Shengen ;
Tai, Yu-Wing .
PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
[44]   Hyperparameter Optimization for Convolutional Neural Networks with Genetic Algorithms and Bayesian Optimization [J].
Puentes G, David E. ;
Barrios H, Carlos J. ;
Navaux, Philippe O. A. .
2022 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2022, :131-135
[45]   Efficient training algorithms for a class of shunting inhibitory convolutional neural networks [J].
Tivive, FHC ;
Bouzerdoum, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (03) :541-556
[46]   Adaptive Integer Quantisation for Convolutional Neural Networks through Evolutionary Algorithms [J].
Wang, Ziwei ;
Trefzer, Martin A. ;
Bale, Simon J. ;
Tyrrell, Andy M. .
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[47]   Human action recognition using genetic algorithms and convolutional neural networks [J].
Ijjina, Earnest Paul ;
Chalavadi, Krishna Mohan .
PATTERN RECOGNITION, 2016, 59 :199-212
[48]   OPTIMAL FILTERING ALGORITHMS FOR FAST LEARNING IN FEEDFORWARD NEURAL NETWORKS [J].
SHAH, S ;
PALMIERI, F ;
DATUM, M .
NEURAL NETWORKS, 1992, 5 (05) :779-787
[49]   A fast magnitude estimation method based on deep convolutional neural networks [J].
Wang ZiFa ;
Liao JiAn ;
Wang YanWei ;
Wei DongLiang ;
Zhao DengKe .
CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2023, 66 (01) :272-288
[50]   Ridiculously Fast Shot Boundary Detection with Fully Convolutional Neural Networks [J].
Gygli, Michael .
2018 16TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2018,