Cloud-Edge Collaborative Inference with Network Pruning

被引:1
|
作者
Li, Mingran [1 ]
Zhang, Xuejun [1 ,2 ,3 ]
Guo, Jiasheng [1 ]
Li, Feng [1 ]
机构
[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China
[2] Guangxi Key Lab Multimedia Commun & Network Techno, Nanning 530004, Peoples R China
[3] Guangxi Big White & Little Black Robots Co Ltd, Nanning 530007, Peoples R China
关键词
collaborative intelligence; network pruning; edge computing; cloud-edge collaborative computing;
D O I
10.3390/electronics12173598
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increase in model parameters, deep neural networks (DNNs) have achieved remarkable performance in computer vision, but larger DNNs create a bottleneck for deploying DNNs on resource-constrained edge devices. The cloud-edge collaborative inference based on network pruning provides a solution for the deployment of DNNs on edge devices. However, the pruning methods adopted by existing frameworks are locally effective, and the compressed models are over-sparse. In this paper, we design a cloud-edge collaborative inference framework based on network pruning to make full use of the limited computing resources on edge devices. In our framework, we propose a sparsity-aware feature bias minimization pruning method to reduce the feature bias that happens during network pruning and prevent the pruned model from being over-sparse. To further reduce the inference latency, we consider the difference in computing resources between edge devices and the cloud, then design a task-oriented asymmetric feature coding to reduce the communication overhead of transmitting intermediate data. With comprehensive experiments, our framework can reduce end-to-end latency by 82% to 84% with less than 1% accuracy loss, compared to the cloud-edge collaborative inference framework with traditional methods, and our framework has the lowest end-to-end latency and accuracy loss compared to other frameworks.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Determine When and How to Perform Edge Rejuvenation Effectively for Cloud-Edge Collaborative System
    Liu, Zhuanzhuan
    Liu, Yiming
    Tan, Xueyong
    Liu, Jing
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 1382 - 1387
  • [32] Vehicle edge server deployment based on reinforcement learning in cloud-edge collaborative environment
    Guo, Feiyan
    Tang, Bing
    Wang, Ying
    Luo, Xiaoqing
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (10): : 14539 - 14556
  • [33] Fine-grained resource adjustment of edge server in cloud-edge collaborative environment
    Peng, Yu
    Hao, Jia
    Chen, Yang
    Gan, Jianhou
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (06): : 7581 - 7598
  • [34] A Cloud-Edge Collaborative Framework for Adaptive Quality Prediction Modeling in IIoT
    Yuan, Xiaofeng
    Wang, Yichen
    Wang, Kai
    Ye, Lingjian
    Shen, Feifan
    Wang, Yalin
    Yang, Chunhua
    Gui, Weihua
    IEEE SENSORS JOURNAL, 2024, 24 (20) : 33656 - 33668
  • [35] DHP: CLOUD-EDGE COLLABORATIVE INTERNET FRAMEWORK FOR NUCLEAR POWER INDUSTRY
    Cheng, Minmin
    Jing, Yingang
    Xu, Kui
    Liu, Xianying
    PROCEEDINGS OF 2024 31ST INTERNATIONAL CONFERENCE ON NUCLEAR ENGINEERING, VOL 1, ICONE31 2024, 2024,
  • [36] Joint Communication and Computation Resource Allocation for Cloud-Edge Collaborative System
    Ren, Jinke
    He, Yinghui
    Yu, Guanding
    Li, Geoffrey Ye
    2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2019,
  • [37] Dynamic Load Combined Prediction Framework with Collaborative Cloud-Edge for Microgrid
    Hou, Wenjing
    Wen, Hong
    Zhang, Ning
    Lei, Wenxin
    Lin, Haojie
    IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
  • [38] Construction and Research on Cloud-edge Collaborative Power Measurement and Security Model
    Huang J.
    Sun Y.
    Jiang X.
    Huang Y.
    Zhou D.
    EAI Endorsed Transactions on Energy Web, 2024, 11 : 1 - 8
  • [39] An Efficient Algorithm for Microservice Placement in Cloud-Edge Collaborative Computing Environment
    He, Xiang
    Xu, Hanchuan
    Xu, Xiaofei
    Chen, Yin
    Wang, Zhongjie
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 1983 - 1997
  • [40] A cloud-edge collaborative task scheduling method based on model segmentation
    Chuanfu Zhang
    Jing Chen
    Wen Li
    Hao Sun
    Yudong Geng
    Tianxiang Zhang
    Mingchao Ji
    Tonglin Fu
    Journal of Cloud Computing, 13