Iterative neural networks for adaptive inference on resource-constrained devices

被引:5
|
作者
Leroux, Sam [1 ]
Verbelen, Tim [1 ]
Simoens, Pieter [1 ]
Dhoedt, Bart [1 ]
机构
[1] Univ Ghent, Dept Informat Technol, IDLab, Technol Pk Zwijnaarde 126, B-9052 Ghent, Belgium
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 13期
关键词
Efficient deep neural networks; Inference on the edge; Adaptive computation; Resource-constrained deep learning; INTERNET;
D O I
10.1007/s00521-022-06910-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The computational cost of evaluating a neural network usually only depends on design choices such as the number of layers or the number of units in each layer and not on the actual input. In this work, we build upon deep Residual Networks (ResNets) and use their properties to design a more efficient adaptive neural network building block. We propose a new architecture, which replaces the sequential layers with an iterative structure where weights are reused multiple times for a single input image, reducing the storage requirements drastically. In addition, we incorporate an adaptive computation module that allows the network to adjust its computational cost at run time for each input sample independently. We experimentally validate our models on image classification, object detection and semantic segmentation tasks and show that our models only use their full capacity for the hardest input samples and are more efficient on average.
引用
收藏
页码:10321 / 10336
页数:16
相关论文
共 50 条
  • [41] Improving training datasets for resource-constrained speaker recognition neural networks
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    INTERSPEECH 2023, 2023, : 3167 - 3171
  • [42] Squeezing Accumulators in Binary Neural Networks for Extremely Resource-Constrained Applications
    Azamat, Azat
    Park, Jaewoo
    Lee, Jongeun
    2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [43] DeepThings: Distributed Adaptive Deep Learning Inference on Resource-Constrained IoT Edge Clusters
    Zhao, Zhuoran
    Barijough, Kamyar Mirzazad
    Gerstlauer, Andreas
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (11) : 2348 - 2359
  • [44] One-Shot Sparse Neural Architecture Search for Resource-Constrained Devices
    Song, Shenghui
    Zaech, Jan-Nico
    Heo, Seonyeong
    2024 IEEE 30TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, RTCSA 2024, 2024, : 132 - 133
  • [45] Spatially Invariant Convolutional Spiking Neural Network For Resource-Constrained IoT Devices
    Yadav, Chetali
    Reniwal, Bhupendra Singh
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025, : 3005 - 3026
  • [46] LiteNet: Lightweight Neural Network for Detecting Arrhythmias at Resource-Constrained Mobile Devices
    He, Ziyang
    Zhang, Xiaoqing
    Cao, Yangjie
    Liu, Zhi
    Zhang, Bo
    Wang, Xiaoyan
    SENSORS, 2018, 18 (04)
  • [47] FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource-Constrained Devices Using Divide and Collaborative Training
    Nguyen, Quan
    Pham, Hieu H.
    Wong, Kok-Seng
    Nguyen, Phi Le
    Nguyen, Truong Thao
    Do, Minh N.
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01): : 418 - 436
  • [48] Resource-adaptive and OOD-robust inference of deep neural networks on IoT devices
    Robertson, Cailen
    Tong, Ngoc Anh
    Nguyen, Thanh Toan
    Nguyen, Quoc Viet Hung
    Jo, Jun
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2025, 10 (01) : 115 - 133
  • [49] Encoding semantic awareness in resource-constrained devices
    Preuveneers, Davy
    Berbers, Yolande
    IEEE INTELLIGENT SYSTEMS, 2008, 23 (02) : 26 - 33
  • [50] SmartDedup: Optimizing Deduplication for Resource-constrained Devices
    Yang, Qirui
    Jin, Runyu
    Zhao, Ming
    PROCEEDINGS OF THE 2019 USENIX ANNUAL TECHNICAL CONFERENCE, 2019, : 633 - 646