BlockDrop: Dynamic Inference Paths in Residual Networks

被引:280
作者
Wu, Zuxuan [1 ]
Nagarajan, Tushar [2 ]
Kumar, Abhishek [3 ]
Rennie, Steven [4 ]
Davis, Larry S. [1 ]
Grauman, Kristen [2 ]
Feris, Rogerio [3 ]
机构
[1] UMD, College Pk, MD 20742 USA
[2] UT Austin, Austin, TX USA
[3] IBM Res, Yorktown Hts, NY USA
[4] Fusemachines Inc, New York, NY USA
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00919
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Very deep convolutional neural networks offer excellent recognition results, yet their computational expense limits their impact for many real-world applications. We introduce BlockDrop, an approach that learns to dynamically choose which layers of a deep network to execute during inference so as to best reduce total computation without degrading prediction accuracy. Exploiting the robustness of Residual Networks (ResNets) to layer dropping, our framework selects on-the-fly which residual blocks to evaluate for a given novel image. In particular, given a pretrained ResNet, we train a policy network in an associative reinforcement learning setting for the dual reward of utilizing a minimal number of blocks while preserving recognition accuracy. We conduct extensive experiments on CIFAR and ImageNet. The results provide strong quantitative and qualitative evidence that these learned policies not only accelerate inference but also encode meaningful visual information. Built upon a ResNet-101 model, our method achieves a speedup of 20% on average, going as high as 36% for some images, while maintaining the same 76.4% top-1 accuracy on ImageNet.
引用
收藏
页码:8817 / 8826
页数:10
相关论文
共 57 条
  • [1] [Anonymous], 2017, ARXIV170100299
  • [2] [Anonymous], 2015, NIPS
  • [3] [Anonymous], 2016, NIPS
  • [4] [Anonymous], 2010, CVPR
  • [5] [Anonymous], 2016, BinaryNet: Training deep neural networks with weights and activa
  • [6] [Anonymous], SLCP
  • [7] [Anonymous], 2015, Very Deep Convolu- tional Networks for Large-Scale Image Recognition
  • [8] [Anonymous], 2017, ARXIV170603912
  • [9] [Anonymous], ICASSP
  • [10] [Anonymous], 2016, ICLR