BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference

被引:11
|
作者
Zhou, Hongbo [1 ,2 ]
Zhang, Weiwei [1 ,2 ]
Wang, Chengwei [1 ]
Ma, Xin [1 ,2 ]
Yu, Haoran [1 ,2 ]
机构
[1] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China
[2] Fujian Prov Acad Engn Res Ctr Ind Intellectual Te, Quanzhou 362021, Peoples R China
关键词
collaborative intelligence; deep learning; model compression; feature compression; cloud computing; INTELLIGENCE;
D O I
10.3390/s21134494
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Edge-cloud collaborative inference can significantly reduce the delay of a deep neural network (DNN) by dividing the network between mobile edge and cloud. However, the in-layer data size of DNN is usually larger than the original data, so the communication time to send intermediate data to the cloud will also increase end-to-end latency. To cope with these challenges, this paper proposes a novel convolutional neural network structure-BBNet-that accelerates collaborative inference from two levels: (1) through channel-pruning: reducing the number of calculations and parameters of the original network; (2) through compressing the feature map at the split point to further reduce the size of the data transmitted. In addition, This paper implemented the BBNet structure based on NVIDIA Nano and the server. Compared with the original network, BBNet's FLOPs and parameter achieve up to 5.67x and 11.57x on the compression rate, respectively. In the best case, the feature compression layer can reach a bit-compression rate of 512x. Compared with the better bandwidth conditions, BBNet has a more obvious inference delay when the network conditions are poor. For example, when the upload bandwidth is only 20 kb/s, the end-to-end latency of BBNet is increased by 38.89x compared with the cloud-only approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Optimizing Face Recognition Inference with a Collaborative Edge-Cloud Network
    Oroceo, Paul P.
    Kim, Jeong-In
    Caliwag, Ej Miguel Francisco
    Kim, Sang-Ho
    Lim, Wansu
    SENSORS, 2022, 22 (21)
  • [2] DeepSplit: Dynamic Splitting of Collaborative Edge-Cloud Convolutional Neural Networks
    Mehta, Rishabh
    Shorey, Rajeev
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [3] Hybrid SLM and LLM for Edge-Cloud Collaborative Inference
    Hao, Zixu
    Jiang, Huiqiang
    Jiang, Shiqi
    Ren, Ju
    Cao, Ting
    PROCEEDINGS OF THE 2024 WORKSHOP ON EDGE AND MOBILE FOUNDATION MODELS, EDGEFM 2024, 2024, : 36 - 41
  • [4] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Zheming YANG
    Wen JI
    Zhi WANG
    Science China(Information Sciences), 2024, 67 (04) : 335 - 336
  • [5] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Yang, Zheming
    Ji, Wen
    Wang, Zhi
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (04)
  • [6] Attacking and Protecting Data Privacy in Edge-Cloud Collaborative Inference Systems
    He, Zecheng
    Zhang, Tianwei
    Lee, Ruby B.
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) : 9706 - 9716
  • [7] An Adaptive Task Migration Scheduling Approach for Edge-Cloud Collaborative Inference
    Zhang, Boyin
    Li, Yinggang
    Zhang, Shigeng
    Zhang, Yue
    Zhu, Bing
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [8] Deploying Deep Neural Network on Edge-Cloud environment
    Kum, Seungwoo
    Kim, Youngkee
    Moon, Jaewon
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 242 - 244
  • [9] A convolutional neural network based online teaching method using edge-cloud computing platform
    Zhong, Liu
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2023, 12 (01):
  • [10] A convolutional neural network based online teaching method using edge-cloud computing platform
    Liu Zhong
    Journal of Cloud Computing, 12