Cloud-Edge Collaborative Inference with Network Pruning

被引:1
|
作者
Li, Mingran [1 ]
Zhang, Xuejun [1 ,2 ,3 ]
Guo, Jiasheng [1 ]
Li, Feng [1 ]
机构
[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China
[2] Guangxi Key Lab Multimedia Commun & Network Techno, Nanning 530004, Peoples R China
[3] Guangxi Big White & Little Black Robots Co Ltd, Nanning 530007, Peoples R China
关键词
collaborative intelligence; network pruning; edge computing; cloud-edge collaborative computing;
D O I
10.3390/electronics12173598
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increase in model parameters, deep neural networks (DNNs) have achieved remarkable performance in computer vision, but larger DNNs create a bottleneck for deploying DNNs on resource-constrained edge devices. The cloud-edge collaborative inference based on network pruning provides a solution for the deployment of DNNs on edge devices. However, the pruning methods adopted by existing frameworks are locally effective, and the compressed models are over-sparse. In this paper, we design a cloud-edge collaborative inference framework based on network pruning to make full use of the limited computing resources on edge devices. In our framework, we propose a sparsity-aware feature bias minimization pruning method to reduce the feature bias that happens during network pruning and prevent the pruned model from being over-sparse. To further reduce the inference latency, we consider the difference in computing resources between edge devices and the cloud, then design a task-oriented asymmetric feature coding to reduce the communication overhead of transmitting intermediate data. With comprehensive experiments, our framework can reduce end-to-end latency by 82% to 84% with less than 1% accuracy loss, compared to the cloud-edge collaborative inference framework with traditional methods, and our framework has the lowest end-to-end latency and accuracy loss compared to other frameworks.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Cloud-edge Collaborative Structure Model for Power Internet of Things
    SI Yufei
    TAN Yanghong
    WANG Feng
    KANG Wenni
    LIU Shan
    中国电机工程学报, 2020, (24) : 8234 - 8234
  • [22] A Cloud-Edge Collaborative System for Object Detection Based on KubeEdge
    Pei, Yifan
    Zhao, Xiaoyan
    Yuan, Peiyan
    Zhang, Haojuan
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 248 - 253
  • [23] Cloud-edge Collaborative Industrial Robotic Intelligent Service Platform
    Wang, Rui
    Mou, Xudong
    Sun, Jie
    Liu, Pin
    Guo, Xiaohui
    Wo, Tianyu
    Liu, Xudong
    2020 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2020), 2020, : 71 - 77
  • [24] Cloud-Edge Intelligence Collaborative Computing: Software, Communication and Human
    Gao, Honghao
    MOBILE NETWORKS & APPLICATIONS, 2023, 29 (5): : 1526 - 1528
  • [25] Security of federated learning for cloud-edge intelligence collaborative computing
    Yang, Jie
    Zheng, Jun
    Zhang, Zheng
    Chen, Q., I
    Wong, Duncan S.
    Li, Yuanzhang
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 9290 - 9308
  • [26] iTaskOffloading: Intelligent Task Offloading for a Cloud-Edge Collaborative System
    Hao, Yixue
    Jiang, Yingying
    Chen, Tao
    Cao, Donggang
    Chen, Min
    IEEE NETWORK, 2019, 33 (05): : 82 - 88
  • [27] Ace-Sniper: Cloud-Edge Collaborative Scheduling Framework With DNN Inference Latency Modeling on Heterogeneous Devices
    Liu, Weihong
    Geng, Jiawei
    Zhu, Zongwei
    Zhao, Yang
    Ji, Cheng
    Li, Changlong
    Lian, Zirui
    Zhou, Xuehai
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (02) : 534 - 547
  • [28] Cut, Distil and Encode (CDE): Split Cloud-Edge Deep Inference
    Sbai, Marion
    Saputra, Muhamad Risqi U.
    Trigoni, Niki
    Markham, Andrew
    2021 18TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING (SECON), 2021,
  • [29] Cloud-edge collaborative high-frequency acquisition data processing for distribution network resilience improvement
    Dang, Sanlei
    Zhang, Jie
    Lu, Tao
    Zhang, Yongwang
    Song, Peng
    Zhang, Jun
    Liu, Rirong
    FRONTIERS IN ENERGY RESEARCH, 2024, 12
  • [30] RBaaS: A Robust Blockchain as a Service Paradigm in Cloud-Edge Collaborative Environment
    Cai, Zhengong
    Yang, Guozheng
    Xu, Shaoyong
    Zang, Cheng
    Chen, Jiajun
    Hang, Pingping
    Yang, Bowei
    IEEE ACCESS, 2022, 10 : 35437 - 35444