Iterative neural networks for adaptive inference on resource-constrained devices

被引:0
|
作者
Sam Leroux
Tim Verbelen
Pieter Simoens
Bart Dhoedt
机构
[1] Ghent University,IDLab, Department of Information Technology
来源
关键词
Efficient deep neural networks; Inference on the edge; Adaptive computation; Resource-constrained deep learning;
D O I
暂无
中图分类号
学科分类号
摘要
The computational cost of evaluating a neural network usually only depends on design choices such as the number of layers or the number of units in each layer and not on the actual input. In this work, we build upon deep Residual Networks (ResNets) and use their properties to design a more efficient adaptive neural network building block. We propose a new architecture, which replaces the sequential layers with an iterative structure where weights are reused multiple times for a single input image, reducing the storage requirements drastically. In addition, we incorporate an adaptive computation module that allows the network to adjust its computational cost at run time for each input sample independently. We experimentally validate our models on image classification, object detection and semantic segmentation tasks and show that our models only use their full capacity for the hardest input samples and are more efficient on average.
引用
收藏
页码:10321 / 10336
页数:15
相关论文
共 50 条
  • [11] Secure Neural Network Inference as a Service with Resource-Constrained Clients
    de Vries, Rik
    Mann, Zoltan Adam
    16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,
  • [12] Adaptive Sparse Deep Neural Network Inference on Resource-Constrained Cost-Efficient GPUs
    Dun, Ming
    Zhang, Xu
    Cao, Huawei
    Zhang, Yuan
    Huang, Junying
    Ye, Xiaochun
    2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC, 2023,
  • [13] A Design Strategy for the Efficient Implementation of Random Basis Neural Networks on Resource-Constrained Devices
    Edoardo Ragusa
    Christian Gianoglio
    Rodolfo Zunino
    Paolo Gastaldo
    Neural Processing Letters, 2020, 51 : 1611 - 1629
  • [14] A Design Strategy for the Efficient Implementation of Random Basis Neural Networks on Resource-Constrained Devices
    Ragusa, Edoardo
    Gianoglio, Christian
    Zunino, Rodolfo
    Gastaldo, Paolo
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1611 - 1629
  • [15] Fully Distributed Deep Learning Inference on Resource-Constrained Edge Devices
    Stahl, Rafael
    Zhao, Zhuoran
    Mueller-Gritschneder, Daniel
    Gerstlauer, Andreas
    Schlichtmann, Ulf
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2019, 2019, 11733 : 77 - 90
  • [16] DeeperThings: Fully Distributed CNN Inference on Resource-Constrained Edge Devices
    Stahl, Rafael
    Hoffman, Alexander
    Mueller-Gritschneder, Daniel
    Gerstlauer, Andreas
    Schlichtmann, Ulf
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2021, 49 (04) : 600 - 624
  • [17] DeeperThings: Fully Distributed CNN Inference on Resource-Constrained Edge Devices
    Rafael Stahl
    Alexander Hoffman
    Daniel Mueller-Gritschneder
    Andreas Gerstlauer
    Ulf Schlichtmann
    International Journal of Parallel Programming, 2021, 49 : 600 - 624
  • [18] Neural Architecture Search for Resource-Constrained Internet of Things Devices
    Cardoso-Pereira, Isadora
    Lobo-Pappa, Gisele
    Ramos, Heitor S.
    26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,
  • [19] Adaptive ResNet Architecture for Distributed Inference in Resource-Constrained IoT Systems
    Khan, Fazeela Mazhar
    Baccour, Emna
    Erbad, Aiman
    Hamdi, Mounir
    2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 1543 - 1549
  • [20] Breathing-Based Authentication on Resource-Constrained IoT Devices using Recurrent Neural Networks
    Chauhan, Jagmohan
    Seneviratne, Suranga
    Hu, Yining
    Misra, Archan
    Seneviratne, Aruna
    Lee, Youngki
    COMPUTER, 2018, 51 (05) : 60 - 67