BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference

被引:11
|
作者
Zhou, Hongbo [1 ,2 ]
Zhang, Weiwei [1 ,2 ]
Wang, Chengwei [1 ]
Ma, Xin [1 ,2 ]
Yu, Haoran [1 ,2 ]
机构
[1] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China
[2] Fujian Prov Acad Engn Res Ctr Ind Intellectual Te, Quanzhou 362021, Peoples R China
关键词
collaborative intelligence; deep learning; model compression; feature compression; cloud computing; INTELLIGENCE;
D O I
10.3390/s21134494
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Edge-cloud collaborative inference can significantly reduce the delay of a deep neural network (DNN) by dividing the network between mobile edge and cloud. However, the in-layer data size of DNN is usually larger than the original data, so the communication time to send intermediate data to the cloud will also increase end-to-end latency. To cope with these challenges, this paper proposes a novel convolutional neural network structure-BBNet-that accelerates collaborative inference from two levels: (1) through channel-pruning: reducing the number of calculations and parameters of the original network; (2) through compressing the feature map at the split point to further reduce the size of the data transmitted. In addition, This paper implemented the BBNet structure based on NVIDIA Nano and the server. Compared with the original network, BBNet's FLOPs and parameter achieve up to 5.67x and 11.57x on the compression rate, respectively. In the best case, the feature compression layer can reach a bit-compression rate of 512x. Compared with the better bandwidth conditions, BBNet has a more obvious inference delay when the network conditions are poor. For example, when the upload bandwidth is only 20 kb/s, the end-to-end latency of BBNet is increased by 38.89x compared with the cloud-only approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Adaptive Distributed Convolutional Neural Network Inference at the Network Edge with ADCNN
    Zhang, Sai Qian
    Lin, Jieyu
    Zhang, Qi
    PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [22] Collaborative DNNs Inference with Joint Model Partition and Compression in Mobile Edge-Cloud Computing Networks
    Tang, Yaxin
    Li, Xiuhua
    Li, Hui
    Yang, Zhengyi
    Wang, Xiaofei
    Leung, Victor C. M.
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [23] EINS: Edge-Cloud Deep Model Inference with Network-Efficiency Schedule in Serverless
    Peng, Shijie
    Lin, Yanying
    Chen, Wenyan
    Tang, Yingfei
    Duan, Xu
    Ye, Kejiang
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1376 - 1381
  • [24] Efficient Computation Offloading for Edge-cloud Collaborative Networks
    Yu, Bocheng
    Zhang, Xingjun
    Wang, Juzhen
    Lei, Ming
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [25] Edge-Cloud Collaborative Computation Offloading for Mixed Traffic
    Li, Qirui
    Guo, Mian
    Peng, Zhiping
    Cui, Delong
    He, Jieguang
    IEEE SYSTEMS JOURNAL, 2023, 17 (03): : 5023 - 5034
  • [26] Edge-cloud Collaborative Learning with Federated and Centralized Features
    Li, Zexi
    Li, Qunwei
    Zhou, Yi
    Zhong, Wenliang
    Zhang, Guannan
    Wu, Chao
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 1949 - 1953
  • [27] A SLAM Algorithm Based on Edge-Cloud Collaborative Computing
    Lv, Taizhi
    Zhang, Juan
    Chen, Yong
    JOURNAL OF SENSORS, 2022, 2022
  • [28] Neural quantile optimization for edge-cloud networking☆ ☆
    Du, Bin
    Zhang, He
    Cheng, Xiangle
    Zhang, Lei
    COMPUTER NETWORKS, 2024, 253
  • [29] Network Security Constrained Distributed Smart Grid Edge-Cloud Collaborative Optimization Scheduling
    Pan, Xi'an
    Ai, Xin
    Hu, Junjie
    Wang, Kunyu
    Wang, Haoyang
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2024, 39 (19): : 6104 - 6118
  • [30] Collaborative Learning-Based Scheduling for Kubernetes-Oriented Edge-Cloud Network
    Shen, Shihao
    Han, Yiwen
    Wang, Xiaofei
    Wang, Shiqiang
    Leung, Victor C. M.
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (06) : 2950 - 2964