Reliable adaptive edge-cloud collaborative DNN inference acceleration scheme combining computing and communication resources in optical networks

被引:1
|
作者
Yin, Shan [1 ]
Jiao, Yurong [1 ]
You, Chenyu [1 ]
Cai, Mengru [1 ]
Jin, Tianyu [1 ]
Huang, Shanguo [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Informat Photon & Opt Commun, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Servers; Collaboration; Optical fiber networks; Computational modeling; Reliability; Cloud computing; ALLOCATION; MULTIUSER; SPECTRUM; INTERNET; THINGS;
D O I
10.1364/JOCN.495765
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the continuous development of the Artificial Intelligence of Things, deep neural network (DNN) models require a larger amount of computing capacity. The emerging edge-cloud collaboration architecture in optical networks is proposed as an effective solution, which combines edge computing with cloud computing to provide faster response and reduce the cloud load for compute-intensive tasks. The multi-layered DNN model can be divided into subtasks that are offloaded to edge and cloud servers for computation in this architecture. In addition, as bearer networks for computing capacity, once a server or link in optical networks fails, a large amount of data can be lost, so the robust reliability of the edge-cloud collaborative optical networks is very important. To solve the above problems, we design a reliable adaptive edge-cloud collaborative DNN inference acceleration scheme (RACAI) combining computing and communication resources. We formulate the RACAI into a mixed integer linear programming model and develop a multi-agent deep reinforcement learning algorithm (MADRL-RACIA) to jointly optimize DNN task partitioning, offloading, and protection. The simulation results show that compared with the benchmark schemes, the proposed MADRL-RACIA can provide a guarantee of reliability for more tasks under latency constraints and reduce the blocking probability.
引用
收藏
页码:750 / 764
页数:15
相关论文
共 26 条
  • [1] An adaptive DNN inference acceleration framework with end-edge-cloud collaborative computing
    Liu, Guozhi
    Dai, Fei
    Xu, Xiaolong
    Fu, Xiaodong
    Dou, Wanchun
    Kumar, Neeraj
    Bilal, Muhammad
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 140 : 422 - 435
  • [2] EosDNN: An Efficient Offloading Scheme for DNN Inference Acceleration in Local-Edge-Cloud Collaborative Environments
    Xue, Min
    Wu, Huaming
    Li, Ruidong
    Xu, Minxian
    Jiao, Pengfei
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2022, 6 (01): : 248 - 264
  • [3] ADDA: Adaptive Distributed DNN Inference Acceleration in Edge Computing Environment
    Wang, Huitian
    Cai, Guangxing
    Huang, Zhaowu
    Dong, Fang
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 438 - 445
  • [4] Collaborative DNNs Inference with Joint Model Partition and Compression in Mobile Edge-Cloud Computing Networks
    Tang, Yaxin
    Li, Xiuhua
    Li, Hui
    Yang, Zhengyi
    Wang, Xiaofei
    Leung, Victor C. M.
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [5] DNN Real-Time Collaborative Inference Acceleration with Mobile Edge Computing
    Yang, Run
    Li, Yan
    He, Hui
    Zhang, Weizhe
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [6] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Zheming YANG
    Wen JI
    Zhi WANG
    Science China(Information Sciences), 2024, 67 (04) : 335 - 336
  • [7] Adaptive joint configuration optimization for collaborative inference in edge-cloud systems
    Yang, Zheming
    Ji, Wen
    Wang, Zhi
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (04)
  • [8] An Adaptive Task Migration Scheduling Approach for Edge-Cloud Collaborative Inference
    Zhang, Boyin
    Li, Yinggang
    Zhang, Shigeng
    Zhang, Yue
    Zhu, Bing
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [9] Efficient Resource Management and Expansion Scheme for Collaborative Edge-Cloud Computing
    Wang, Wei
    Zhang, Yongmin
    Huang, Rui
    Ren, Ju
    Lyu, Feng
    Zhang, Yaoxue
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (04) : 2731 - 2747
  • [10] An Adaptive Neural Architecture Search Design for Collaborative Edge-Cloud Computing
    Lu, Haodong
    Du, Miao
    He, Xiaoming
    Qian, Kai
    Chen, Jianli
    Sun, Yanfei
    Wang, Kun
    IEEE NETWORK, 2021, 35 (05): : 83 - 89