Cloud-Edge Collaborative Inference with Network Pruning

被引:1
|
作者
Li, Mingran [1 ]
Zhang, Xuejun [1 ,2 ,3 ]
Guo, Jiasheng [1 ]
Li, Feng [1 ]
机构
[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China
[2] Guangxi Key Lab Multimedia Commun & Network Techno, Nanning 530004, Peoples R China
[3] Guangxi Big White & Little Black Robots Co Ltd, Nanning 530007, Peoples R China
关键词
collaborative intelligence; network pruning; edge computing; cloud-edge collaborative computing;
D O I
10.3390/electronics12173598
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increase in model parameters, deep neural networks (DNNs) have achieved remarkable performance in computer vision, but larger DNNs create a bottleneck for deploying DNNs on resource-constrained edge devices. The cloud-edge collaborative inference based on network pruning provides a solution for the deployment of DNNs on edge devices. However, the pruning methods adopted by existing frameworks are locally effective, and the compressed models are over-sparse. In this paper, we design a cloud-edge collaborative inference framework based on network pruning to make full use of the limited computing resources on edge devices. In our framework, we propose a sparsity-aware feature bias minimization pruning method to reduce the feature bias that happens during network pruning and prevent the pruned model from being over-sparse. To further reduce the inference latency, we consider the difference in computing resources between edge devices and the cloud, then design a task-oriented asymmetric feature coding to reduce the communication overhead of transmitting intermediate data. With comprehensive experiments, our framework can reduce end-to-end latency by 82% to 84% with less than 1% accuracy loss, compared to the cloud-edge collaborative inference framework with traditional methods, and our framework has the lowest end-to-end latency and accuracy loss compared to other frameworks.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Sniper: Cloud-Edge Collaborative Inference Scheduling with Neural Network Similarity Modeling
    Liu, Weihong
    Geng, Jiawei
    Zhu, Zongwei
    Cao, Jing
    Lian, Zirui
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 505 - 510
  • [2] An Efficiency Evaluation Method for Cloud-Edge Collaborative Network
    Jin, Shen
    Qu, Qinghai
    Feng, Yuqing
    Zhang, Ningchi
    Cong, Lin
    Wang, Ying
    Yu, Peng
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 51 - 56
  • [3] Reliable Function Computation Offloading in Cloud-Edge Collaborative Network
    Li, Shaonan
    Xie, Yongqiang
    Li, Zhongbo
    Qi, Jin
    Tian, Yumeng
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT II, 2024, 14488 : 433 - 451
  • [4] A collaborative cloud-edge computing framework in distributed neural network
    Xu, Shihao
    Zhang, Zhenjiang
    Kadoch, Michel
    Cheriet, Mohamed
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2020, 2020 (01)
  • [5] A collaborative cloud-edge computing framework in distributed neural network
    Shihao Xu
    Zhenjiang Zhang
    Michel Kadoch
    Mohamed Cheriet
    EURASIP Journal on Wireless Communications and Networking, 2020
  • [6] Cloud-Edge Collaborative Optimization Based on Distributed UAV Network
    Yang, Jian
    Tao, Jinyu
    Wang, Cheng
    Yang, Qinghai
    ELECTRONICS, 2024, 13 (18)
  • [7] From cloud manufacturing to cloud-edge collaborative manufacturing
    Guo, Liang
    He, Yunlong
    Wan, Changcheng
    Li, Yuantong
    Luo, Longkun
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2024, 90
  • [8] Network Resource Optimization with Latency Sensitivity in Collaborative Cloud-Edge Computing Networks
    Liu, Ling
    Ma, Weike
    Chen, Bowen
    Gao, Mingyi
    Chen, Hong
    Wu, Jinbing
    2020 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP) AND INTERNATIONAL CONFERENCE ON INFORMATION PHOTONICS AND OPTICAL COMMUNICATIONS (IPOC), 2020,
  • [9] A Collaborative Cloud-Edge Approach for Robust Edge Workload Forecasting
    Li, Yanan
    Zhao, Penghong
    Ma, Xiao
    Yuan, Haitao
    Fu, Zhe
    Xu, Mengwei
    Wang, Shangguang
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (04) : 2861 - 2875
  • [10] SGX Based Cloud-Edge Collaborative Secure Deduplication
    Wu, Jian
    Fu, Yinjin
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 112 - 113