BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference

被引:11
|
作者
Zhou, Hongbo [1 ,2 ]
Zhang, Weiwei [1 ,2 ]
Wang, Chengwei [1 ]
Ma, Xin [1 ,2 ]
Yu, Haoran [1 ,2 ]
机构
[1] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China
[2] Fujian Prov Acad Engn Res Ctr Ind Intellectual Te, Quanzhou 362021, Peoples R China
关键词
collaborative intelligence; deep learning; model compression; feature compression; cloud computing; INTELLIGENCE;
D O I
10.3390/s21134494
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Edge-cloud collaborative inference can significantly reduce the delay of a deep neural network (DNN) by dividing the network between mobile edge and cloud. However, the in-layer data size of DNN is usually larger than the original data, so the communication time to send intermediate data to the cloud will also increase end-to-end latency. To cope with these challenges, this paper proposes a novel convolutional neural network structure-BBNet-that accelerates collaborative inference from two levels: (1) through channel-pruning: reducing the number of calculations and parameters of the original network; (2) through compressing the feature map at the split point to further reduce the size of the data transmitted. In addition, This paper implemented the BBNet structure based on NVIDIA Nano and the server. Compared with the original network, BBNet's FLOPs and parameter achieve up to 5.67x and 11.57x on the compression rate, respectively. In the best case, the feature compression layer can reach a bit-compression rate of 512x. Compared with the better bandwidth conditions, BBNet has a more obvious inference delay when the network conditions are poor. For example, when the upload bandwidth is only 20 kb/s, the end-to-end latency of BBNet is increased by 38.89x compared with the cloud-only approach.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Nebula: An Edge-Cloud Collaborative Learning Framework for Dynamic Edge Environments
    Zhuang, Yan
    Zheng, Zhenzhe
    Shao, Yunfeng
    Li, Bingshuai
    Wu, Fan
    Chen, Guihai
    53RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2024, 2024, : 782 - 791
  • [32] Edge-Cloud Collaborative UAV Object Detection: Edge-Embedded Lightweight Algorithm Design and Task Offloading Using Fuzzy Neural Network
    Yuan, Yazhou
    Gao, Shicong
    Zhang, Ziteng
    Wang, Wenye
    Xu, Zhezhuang
    Liu, Zhixin
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2024, 12 (01) : 306 - 318
  • [33] Accelerated Inference of Face Detection under Edge-Cloud Collaboration
    Zhang, Weiwei
    Zhou, Hongbo
    Mo, Jian
    Zhen, Chenghui
    Ji, Ming
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [34] Collaborative Optimization of Edge-Cloud Computation Offloading in Internet of Vehicles
    Li, Yureng
    Xu, Shouzhi
    30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
  • [35] Using Collaborative Edge-Cloud Cache for Search in Internet of Things
    Tang, Jine
    Zhou, Zhangbing
    Xue, Xiao
    Wang, Gongwen
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (02) : 922 - 936
  • [36] MPCSM: Microservice Placement for Edge-Cloud Collaborative Smart Manufacturing
    Wang, Yimeng
    Zhao, Cong
    Yang, Shusen
    Ren, Xuebin
    Wang, Luhui
    Zhao, Peng
    Yang, Xinyu
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) : 5898 - 5908
  • [37] Task Offloading and Resource Allocation for Edge-Cloud Collaborative Computing
    Wang, Yaxing
    Hao, Jia
    Xu, Gang
    Huang, Baoqi
    Zhang, Feng
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT V, 2024, 14491 : 361 - 372
  • [38] Shoggoth: Towards Efficient Edge-Cloud Collaborative Real-Time Video Inference via Adaptive Online
    Wang, Liang
    Lu, Kai
    Zhang, Nan
    Qu, Xiaoyang
    Wang, Jianzong
    Wan, Jiguang
    Li, Guokuan
    Xiao, Jing
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [39] A Splittable DNN-Based Object Detector for Edge-Cloud Collaborative Real-Time Video Inference
    Lee, Joo Chan
    Kim, Yongwoo
    Moon, SungTae
    Ko, Jong Hwan
    2021 17TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2021), 2021,
  • [40] Task offloading optimization mechanism based on deep neural network in edge-cloud environment
    Meng, Lingkang
    Wang, Yingjie
    Wang, Haipeng
    Tong, Xiangrong
    Sun, Zice
    Cai, Zhipeng
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2023, 12 (01):