BitFlow: Exploiting Vector Parallelism for Binary Neural Networks on CPU

被引:26
作者
Hu, Yuwei [1 ]
Zhai, Jidong [2 ]
Li, Dinghua [1 ]
Gong, Yifan [1 ]
Zhu, Yuhao [3 ]
Liu, Wei [1 ]
Su, Lei [1 ]
Jin, Jiangming [1 ]
机构
[1] TuSimple Inc, Beijing, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci, Beijing, Peoples R China
[3] Univ Rochester, Dept Comp Sci, Rochester, NY 14627 USA
来源
2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS) | 2018年
关键词
D O I
10.1109/IPDPS.2018.00034
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning has revolutionized computer vision and other fields since its big bang in 2012. However, it is challenging to deploy Deep Neural Networks (DNNs) into real-world applications due to their high computational complexity. Binary Neural Networks (BNNs) dramatically reduce computational complexity by replacing most arithmetic operations with bitwise operations. Existing implementations of BNNs have been focusing on GPU or FPGA, and using the conventional image-to-column method that doesn't perform well for binary convolution due to low arithmetic intensity and unfriendly pattern for bitwise operations. We propose BitFlow, a gemm-operator-network three-level optimization framework for fully exploiting the computing power of BNNs on CPU. BitFlow features a new class of algorithm named PressedConv for efficient binary convolution using locality-aware layout and vector parallelism. We evaluate BitFlow with the VGG network. On a single core of Intel Xeon Phi, BitFlow obtains 1.8x speedup over unoptimized BNN implementations, and 11.5x speedup over counterpart full-precision DNNs. Over 64 cores, BitFlow enables BNNs to run 1.1x faster than counterpart full-precision DNNs on GPU (GTX 1080).
引用
收藏
页码:244 / 253
页数:10
相关论文
共 33 条
[1]  
[Anonymous], ARXIV E PRINTS
[2]  
[Anonymous], 2016, BinaryNet: Training deep neural networks with weights and activa
[3]  
[Anonymous], 2014, ARXIV14062989
[4]  
[Anonymous], 2017, COMMUNICATION
[5]  
[Anonymous], 2015, Adv. Neural Inform. Process. Syst.
[6]   High Prevalence of Assisted Injection Among Street-Involved Youth in a Canadian Setting [J].
Cheng, Tessa ;
Kerr, Thomas ;
Small, Will ;
Dong, Huiru ;
Montaner, Julio ;
Wood, Evan ;
DeBeck, Kora .
AIDS AND BEHAVIOR, 2016, 20 (02) :377-384
[7]  
Cire\csan D.C., 2011, P 22 INT JOINT C ART, VTwo, P1237
[8]  
Courbariaux Matthieu, 2015, CoRR
[9]  
Courbariaux Matthieu., 2014, CoRR
[10]   Improving the arithmetic intensity of multigrid with the help of polynomial smoothers [J].
Ghysels, P. ;
Klosiewicz, P. ;
Vanroose, W. .
NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2012, 19 (02) :253-267