Adaptive ResNet Architecture for Distributed Inference in Resource-Constrained IoT Systems

被引:4
作者
Khan, Fazeela Mazhar [1 ]
Baccour, Emna [1 ]
Erbad, Aiman [1 ]
Hamdi, Mounir [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Fdn, Coll Sci & Engn, Doha, Qatar
来源
2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC | 2023年
关键词
optimization; distributed inference; neural networks; resilience; ResNet;
D O I
10.1109/IWCMC58020.2023.10182881
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As deep neural networks continue to expand and become more complex, most edge devices are unable to handle their extensive processing requirements. Therefore, the concept of distributed inference is essential to distribute the neural network among a cluster of nodes. However, distribution may lead to additional energy consumption and dependency among devices that suffer from unstable transmission rates. Unstable transmission rates harm real-time performance of IoT devices causing low latency, high energy usage, and potential failures. Hence, for dynamic systems, it is necessary to have a resilient DNN with an adaptive architecture that can downsize as per the available resources. This paper presents an empirical study that identifies the connections in ResNet that can be dropped without significantly impacting the model's performance to enable distribution in case of resource shortage. Based on the results, a multi-objective optimization problem is formulated to minimize latency and maximize accuracy as per available resources. Our experiments demonstrate that an adaptive ResNet architecture can reduce shared data, energy consumption, and latency throughout the distribution while maintaining high accuracy.
引用
收藏
页码:1543 / 1549
页数:7
相关论文
共 12 条
  • [1] Pervasive AI for IoT Applications: A Survey on Resource-Efficient Distributed Artificial Intelligence
    Baccour, Emna
    Mhaisen, Naram
    Abdellatif, Alaa Awad
    Erbad, Aiman
    Mohamed, Amr
    Hamdi, Mounir
    Guizani, Mohsen
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2022, 24 (04): : 2366 - 2418
  • [2] RL-DistPrivacy: Privacy-Aware Distributed Deep Inference for Low Latency IoT Systems
    Baccour, Emna
    Erbad, Aiman
    Mohamed, Amr
    Hamdi, Mounir
    Guizani, Mohsen
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (04): : 2066 - 2083
  • [3] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [4] Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
    Jacob, Benoit
    Kligys, Skirmantas
    Chen, Bo
    Zhu, Menglong
    Tang, Matthew
    Howard, Andrew
    Adam, Hartwig
    Kalenichenko, Dmitry
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2704 - 2713
  • [5] Distributed CNN Inference on Resource-Constrained UAVs for Surveillance Systems: Design and Optimization
    Jouhari, Mohammed
    Al-Ali, Abdulla Khalid
    Baccour, Emna
    Mohamed, Amr
    Erbad, Aiman
    Guizani, Mohsen
    Hamdi, Mounir
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (02) : 1227 - 1242
  • [6] LeCun Yann, 2018, 2018 IEEECVF C COMPU, P2810
  • [7] Edge Intelligence: On-Demand Deep Learning Model Co-Inference with Device-Edge Synergy
    Li, En
    Zhou, Zhi
    Chen, Xu
    [J]. MECOMM'18: PROCEEDINGS OF THE 2018 WORKSHOP ON MOBILE EDGE COMMUNICATIONS, 2018, : 31 - 36
  • [8] EdgeEye - An Edge Service Framework for Real-time Intelligent Video Analytics
    Liu, Peng
    Qi, Bozhao
    Banerjee, Suman
    [J]. EDGESYS'18: PROCEEDINGS OF THE FIRST ACM INTERNATIONAL WORKSHOP ON EDGE SYSTEMS, ANALYTICS AND NETWORKING, 2018, : 1 - 6
  • [9] Molchanov P, 2017, Arxiv, DOI arXiv:1611.06440
  • [10] Park J, 2019, Arxiv, DOI arXiv:1812.02858