BALANCED BINARY NEURAL NETWORKS WITH GATED RESIDUAL

被引:22
作者
Shen, Mingzhu [1 ]
Liu, Xianglong [1 ]
Gong, Ruihao [1 ]
Han, Kai [2 ,3 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Chinese Acad Sci, State Key Lab Comp Sci, Institude Software, Beijing, Peoples R China
[3] UCAS, Beijing, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
model compression; binary neural networks; energy-efficient models;
D O I
10.1109/icassp40776.2020.9054599
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Binary neural networks have attracted numerous attention in recent years. However, mainly due to the information loss stemming from the biased binarization, how to preserve the accuracy of networks still remains a critical issue. In this paper, we attempt to maintain the information propagated in the forward process and propose a Balanced Binary Neural Networks with Gated Residual (BBG for short). First, a weight balanced binarization is introduced and thus the informative binary weights can capture more information contained in the activations. Second, for binary activations, a gated residual is further appended to compensate their information loss during the forward process, with a slight overhead. Both techniques can be wrapped as a generic network module that supports various network architectures for different tasks including classification and detection. The experimental results show that BBG-Net performs remarkably well across various network architectures such as VGG, ResNet and SSD with the superior performance over state-of-the-art methods.
引用
收藏
页码:4197 / 4201
页数:5
相关论文
共 20 条
[1]  
[Anonymous], 2015, ICML WORKSH DEEP LEA
[2]  
Bethge Joseph, 2019, ARXIV190608637
[3]  
Gu JX, 2019, AAAI CONF ARTIF INTE, P8344
[4]  
HE KM, 2016, PROC CVPR IEEE, P770, DOI DOI 10.1109/CVPR.2016.90
[5]   Optimal Force-Based Beam-Column Element Size for Reinforced-Concrete Piles in Bridges [J].
He, Zhongying ;
Liu, Weian ;
Wang, Xiaowei ;
Ye, Aijun .
JOURNAL OF BRIDGE ENGINEERING, 2016, 21 (11)
[6]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[7]   Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference [J].
Jacob, Benoit ;
Kligys, Skirmantas ;
Chen, Bo ;
Zhu, Menglong ;
Tang, Matthew ;
Howard, Andrew ;
Adam, Hartwig ;
Kalenichenko, Dmitry .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2704-2713
[8]   Circulant Binary Convolutional Networks: Enhancing the Performance of 1-bit DCNNs with Circulant Back Propagation [J].
Liu, Chunlei ;
Ding, Wenrui ;
Xia, Xin ;
Zhang, Baochang ;
Gu, Jiaxin ;
Liu, Jianzhuang ;
Ji, Rongrong ;
Doermann, David .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2686-2694
[9]   Geopolymerization enhanced hydrothermal synthesis of analcime from steel slag and CFBC fly ash and heavy metal adsorption on analcime [J].
Liu, Ze ;
Li, Li ;
Shao, Ningning ;
Hu, Tao ;
Han, Le ;
Wang, Dongmin .
ENVIRONMENTAL TECHNOLOGY, 2020, 41 (14) :1753-1765
[10]   XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks [J].
Rastegari, Mohammad ;
Ordonez, Vicente ;
Redmon, Joseph ;
Farhadi, Ali .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :525-542