Cloud-Edge Collaborative Inference with Network Pruning

被引:1
|
作者
Li, Mingran [1 ]
Zhang, Xuejun [1 ,2 ,3 ]
Guo, Jiasheng [1 ]
Li, Feng [1 ]
机构
[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China
[2] Guangxi Key Lab Multimedia Commun & Network Techno, Nanning 530004, Peoples R China
[3] Guangxi Big White & Little Black Robots Co Ltd, Nanning 530007, Peoples R China
关键词
collaborative intelligence; network pruning; edge computing; cloud-edge collaborative computing;
D O I
10.3390/electronics12173598
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increase in model parameters, deep neural networks (DNNs) have achieved remarkable performance in computer vision, but larger DNNs create a bottleneck for deploying DNNs on resource-constrained edge devices. The cloud-edge collaborative inference based on network pruning provides a solution for the deployment of DNNs on edge devices. However, the pruning methods adopted by existing frameworks are locally effective, and the compressed models are over-sparse. In this paper, we design a cloud-edge collaborative inference framework based on network pruning to make full use of the limited computing resources on edge devices. In our framework, we propose a sparsity-aware feature bias minimization pruning method to reduce the feature bias that happens during network pruning and prevent the pruned model from being over-sparse. To further reduce the inference latency, we consider the difference in computing resources between edge devices and the cloud, then design a task-oriented asymmetric feature coding to reduce the communication overhead of transmitting intermediate data. With comprehensive experiments, our framework can reduce end-to-end latency by 82% to 84% with less than 1% accuracy loss, compared to the cloud-edge collaborative inference framework with traditional methods, and our framework has the lowest end-to-end latency and accuracy loss compared to other frameworks.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Blaze: Delay-Aware Cloud-Edge Collaborative Service Function Chain Deployment with Network Calculus
    Luo, Huimin
    Zhang, Jiao
    Pan, Yongchen
    Pan, Tian
    Huang, Tao
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [42] Dynamic anomaly detection using In-band Network Telemetry and GCN for cloud-edge collaborative networks
    Pei, Jinchuan
    Hu, Yuxiang
    Tian, Le
    Pei, Xinglong
    Wang, Zihao
    COMPUTERS & SECURITY, 2025, 154
  • [43] A cloud-edge collaborative task scheduling method based on model segmentation
    Zhang, Chuanfu
    Chen, Jing
    Li, Wen
    Sun, Hao
    Geng, Yudong
    Zhang, Tianxiang
    Ji, Mingchao
    Fu, Tonglin
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2024, 13 (01):
  • [44] Task Scheduling with Optimized Transmission Time in Collaborative Cloud-Edge Learning
    Huang, Yutao
    Zhu, Yifei
    Fan, Xiaoyi
    Ma, Xiaoqiang
    Wang, Fangxin
    Liu, Jiangchuan
    Wang, Ziyi
    Cui, Yong
    2018 27TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN), 2018,
  • [45] CLOSED: A Cloud-Edge Dynamic Collaborative Strategy for Complex Event Detection
    Cao, Jian
    Huang, He
    Qian, Shiyou
    2022 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2022), 2022, : 73 - 78
  • [46] Smart electronic gastroscope system using a cloud-edge collaborative framework
    Ding, Shuai
    Li, Ling
    Li, Zhenmin
    Wang, Hao
    Zhang, Yanchun
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 100 : 395 - 407
  • [47] Priority-Based Offloading Optimization in Cloud-Edge Collaborative Computing
    He, Zhenli
    Xu, Yanan
    Zhao, Mingxiong
    Zhou, Wei
    Li, Keqin
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (06) : 3906 - 3919
  • [48] MKDC: A Lightweight Method for Cloud-Edge Collaborative Fault Diagnosis Model
    Wang, Yinjun
    Zhang, Zhigang
    Yang, Yang
    Xue, Chunrong
    Zhang, Wanhao
    Wang, Liming
    Ding, Xiaoxi
    IEEE SENSORS JOURNAL, 2024, 24 (20) : 32607 - 32618
  • [49] CEBPM: A Cloud-Edge Collaborative Noncontact Blood Pressure Estimation Model
    Jia, Mengru
    Qin, Yuting
    Song, Cheng
    Yue, Zijie
    Ding, Shuai
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [50] BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference
    Zhou, Hongbo
    Zhang, Weiwei
    Wang, Chengwei
    Ma, Xin
    Yu, Haoran
    SENSORS, 2021, 21 (13)