Towards Resource-aware DNN Partitioning for Edge Devices with Heterogeneous Resources

被引:4
|
作者
Zawish, Muhammad [1 ]
Abraham, Lizy [1 ]
Dev, Kapal [2 ]
Davy, Steven [1 ]
机构
[1] South East Technol Univ, Walton Inst Informat & Commun Syst Sci, Carlow, Ireland
[2] Munster Technol Univ, Dept Comp Sci, Cork, Ireland
基金
爱尔兰科学基金会;
关键词
Edge intelligence; Collaborative DNN inference; Resource efficiency; Internet of things;
D O I
10.1109/GLOBECOM48099.2022.10000839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collaborative deep neural network (DNN) inference over edge and cloud is emerging as an effective approach for enabling several Internet of Things (IoT) applications. Edge devices are mainly resource-constrained and hence can not afford the computational complexity manifested by DNNs. Thereby, researchers have resorted to a collaborative computing approach, where a DNN is partitioned between edge and cloud. Recent art on DNN partitioning has either focused on bandwidth-specific partitioning or relied on offline benchmarking of DNN layers. However, edge devices are inherently heterogeneous and possess inconsistent levels and types of resources. Therefore, in this work, we propose a resource-aware partitioning of DNNs for accelerating collaborative inference over edge-cloud. The proposed approach provides the flexibility of partitioning a DNN with respect to the available nature and scale of resources for a certain edge device. Unlike state-of-the-art, we exploit different types of DNN complexities for partitioning them on heterogeneous edge devices. For example, in a bandwidth-constrained scenario, our approach gained 40% efficiency as compared to the offline benchmarking approach. Therefore, given the different nature of edge devices' computational, storage, and energy requirements, this approach provides a suitable configuration for edge-cloud synergetic inference.
引用
收藏
页码:5649 / 5655
页数:7
相关论文
共 50 条
  • [1] Resource-Aware DNN Partitioning for Privacy-Sensitive Edge-Cloud Systems
    Ding, Aolin
    Hass, Amin
    Chan, Matthew
    Sehatbakhsh, Nader
    Zonouz, Saman
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT V, 2024, 14451 : 188 - 201
  • [2] On-the-fly Resource-Aware Model Aggregation for Federated Learning in Heterogeneous Edge
    Nguyen, Hung T.
    Morabito, Roberto
    Kim, Kwang Taik
    Chiang, Mung
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [3] Resource-Aware Workload Orchestration for Edge Computing
    Babirye, Susan
    Serugunda, Jonathan
    Okello, Dorothy
    Mwanje, Stephen
    2020 28TH TELECOMMUNICATIONS FORUM (TELFOR), 2020, : 117 - 120
  • [4] Resource-aware scientific computation on a heterogeneous cluster
    Teresco, JD
    Faik, J
    Flaherty, JE
    COMPUTING IN SCIENCE & ENGINEERING, 2005, 7 (02) : 40 - 50
  • [5] Multi-resource aware partitioning algorithms for FPGAs with heterogeneous resources
    Selvakkumaran, N
    Ranjan, A
    Raje, S
    Karypis, G
    41ST DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2004, 2004, : 741 - 746
  • [6] Multilayer Resource-aware Partitioning for Fog Application Placement
    Samani, Zahra Najafabadi
    Saurabh, Nishant
    Prodan, Radu
    5TH IEEE INTERNATIONAL CONFERENCE ON FOG AND EDGE COMPUTING (ICFEC 2021), 2021, : 9 - 18
  • [7] CoEdge: Cooperative DNN Inference With Adaptive Workload Partitioning Over Heterogeneous Edge Devices
    Zeng, Liekang
    Chen, Xu
    Zhou, Zhi
    Yang, Lei
    Zhang, Junshan
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2021, 29 (02) : 595 - 608
  • [8] Towards Resource-Aware Hybrid Camera Systems
    Schranz, Melanie
    Andre, Torsten
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC'18), 2018,
  • [9] Resource-Aware Feature Extraction in Mobile Edge Computing
    Ding, Chuntao
    Zhou, Ao
    Liu, Xiulong
    Ma, Xiao
    Wang, Shangguang
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (01) : 321 - 331
  • [10] Resource-Aware Edge-Based Stream Analytics
    Petri, Ioan
    Chirila, Ioan
    Gomes, Heitor Murilo
    Bifet, Albert
    Rana, Omer F.
    IEEE INTERNET COMPUTING, 2022, 26 (04) : 79 - 88