Collaborative Inference via Ensembles on the Edge

被引:18
|
作者
Shlezinger, Nir [1 ]
Farhan, Erez [2 ]
Morgenstern, Hai [2 ]
Eldar, Yonina C. [3 ]
机构
[1] Ben Gurion Univ Negev, Sch ECE, Beer Sheva, Israel
[2] BeyondMinds, Tel Aviv, Israel
[3] Weizmann Inst Sci, Fac Math & CS, Rehovot, Israel
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
基金
以色列科学基金会;
关键词
Edge computing; deep ensembles; neural networks;
D O I
10.1109/ICASSP39728.2021.9414740
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The success of deep neural networks (DNNs) as an enabler of artificial intelligence (AI) is heavily dependent on high computational resources. The increasing demands for accessible and personalized AI give rise to the need to operate DNNs on edge devices such as smartphones, sensors, and autonomous cars, whose computational powers are limited. Here we propose a framework for facilitating the application of DNNs on the edge in a manner which allows multiple users to collaborate during inference in order to improve their prediction accuracy. Our mechanism, referred to as edge ensembles, is based on having diverse predictors at each device, which can form a deep ensemble during inference. We analyze the latency induced in this collaborative inference approach, showing that the ability to improve performance via collaboration comes at the cost of a minor additional delay. Our experimental results demonstrate that collaborative inference via edge ensembles equipped with compact DNNs substantially improves the accuracy over having each user infer locally, and can outperform using a single centralized DNN larger than all the networks in the ensemble together.
引用
收藏
页码:8478 / 8482
页数:5
相关论文
共 50 条
  • [41] STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining
    Guo, Liwei
    Choe, Wonkyo
    Lin, Felix Xiaozhu
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 2, ASPLOS 2023, 2023, : 791 - 803
  • [42] Inference on the prediction of ensembles of infinite size
    Hernandez-Lobato, Daniel
    Martinez-Munoz, Gonzalo
    Suarez, Alberto
    PATTERN RECOGNITION, 2011, 44 (07) : 1426 - 1434
  • [43] Deriving Probabilistic Databases with Inference Ensembles
    Stoyanovich, Julia
    Davidson, Susan
    Milo, Tova
    Tannen, Val
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 303 - 314
  • [44] Reinforcement Learning Based Energy-Efficient Collaborative Inference for Mobile Edge Computing
    Xiao, Yilin
    Xiao, Liang
    Wan, Kunpeng
    Yang, Helin
    Zhang, Yi
    Wu, Yi
    Zhang, Yanyong
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (02) : 864 - 876
  • [45] An adaptive DNN inference acceleration framework with end-edge-cloud collaborative computing
    Liu, Guozhi
    Dai, Fei
    Xu, Xiaolong
    Fu, Xiaodong
    Dou, Wanchun
    Kumar, Neeraj
    Bilal, Muhammad
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 140 : 422 - 435
  • [46] AppealNet: An Efficient and Highly-Accurate Edge/Cloud Collaborative Architecture for DNN Inference
    Li, Min
    Li, Yu
    Tian, Ye
    Jiang, Li
    Xu, Qiang
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 409 - 414
  • [47] Sniper: Cloud-Edge Collaborative Inference Scheduling with Neural Network Similarity Modeling
    Liu, Weihong
    Geng, Jiawei
    Zhu, Zongwei
    Cao, Jing
    Lian, Zirui
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 505 - 510
  • [48] Collaborative Inference Acceleration Integrating DNN Partitioning and Task Offloading in Mobile Edge Computing
    Xu, Wenxiu
    Yin, Yin
    Chen, Ningjiang
    Tu, Huan
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (11N12) : 1835 - 1863
  • [49] Adaptive Workload Distribution for Accuracy-aware DNN Inference on Collaborative Edge Platforms
    Taufique, Zain
    Miele, Antonio
    Liljeberg, Pasi
    Kanduri, Anil
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 109 - 114
  • [50] BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference
    Zhou, Hongbo
    Zhang, Weiwei
    Wang, Chengwei
    Ma, Xin
    Yu, Haoran
    SENSORS, 2021, 21 (13)