Flexible Deployment of Machine Learning Inference Pipelines in the Cloud-Edge-IoT Continuum

被引:2
作者
Bogacka, Karolina [1 ,2 ]
Sowinski, Piotr [1 ,2 ]
Danilenka, Anastasiya [1 ,2 ]
Biot, Francisco Mahedero [3 ]
Wasielewska-Michniewska, Katarzyna [1 ]
Ganzha, Maria [1 ,2 ]
Paprzycki, Marcin [1 ]
Palau, Carlos E. [3 ]
机构
[1] Polish Acad Sci, Syst Res Inst, Ul Newelska 6, PL-01447 Warsaw, Poland
[2] Warsaw Univ Technol, Fac Math & Informat Sci, Ul Koszykowa 75, PL-00662 Warsaw, Poland
[3] Univ Politecn Valencia, Commun Dept, Cami Vera S N, Valencia 46022, Spain
关键词
machine learning; edge computing; IoT; cloud-edge-IoT; inference; gRPC; inference server; INTERNET;
D O I
10.3390/electronics13101888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, deploying machine learning workloads in the Cloud-Edge-IoT continuum is challenging due to the wide variety of available hardware platforms, stringent performance requirements, and the heterogeneity of the workloads themselves. To alleviate this, a novel, flexible approach for machine learning inference is introduced, which is suitable for deployment in diverse environments-including edge devices. The proposed solution has a modular design and is compatible with a wide range of user-defined machine learning pipelines. To improve energy efficiency and scalability, a high-performance communication protocol for inference is propounded, along with a scale-out mechanism based on a load balancer. The inference service plugs into the ASSIST-IoT reference architecture, thus taking advantage of its other components. The solution was evaluated in two scenarios closely emulating real-life use cases, with demanding workloads and requirements constituting several different deployment scenarios. The results from the evaluation show that the proposed software meets the high throughput and low latency of inference requirements of the use cases while effectively adapting to the available hardware. The code and documentation, in addition to the data used in the evaluation, were open-sourced to foster adoption of the solution.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] Towards a Model-Based Serverless Platform for the Cloud-Edge-IoT Continuum
    Ferry, Nicolas
    Dautov, Rustem
    Song, Hui
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 851 - 858
  • [2] Leveraging the serverless paradigm for realizing machine learning pipelines across the edge-cloud continuum
    Paraskevoulakou, Efterpi
    Kyriazis, Dimosthenis
    2021 24TH CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS AND WORKSHOPS (ICIN), 2021,
  • [3] Model-based fleet deployment in the IoT–edge–cloud continuum
    Hui Song
    Rustem Dautov
    Nicolas Ferry
    Arnor Solberg
    Franck Fleurey
    Software and Systems Modeling, 2022, 21 : 1931 - 1956
  • [4] A Review of Application Layer Communication Protocols for the IoT Edge Cloud Continuum
    Kampars, Janis
    Tropins, Dainis
    Matisons, Ralfs
    2021 62ND INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE OF RIGA TECHNICAL UNIVERSITY (ITMS), 2021,
  • [5] Power Efficient Machine Learning Models Deployment on Edge IoT Devices
    Fanariotis, Anastasios
    Orphanoudakis, Theofanis
    Kotrotsios, Konstantinos
    Fotopoulos, Vassilis
    Keramidas, George
    Karkazis, Panagiotis
    SENSORS, 2023, 23 (03)
  • [6] Model-based fleet deployment in the IoT-edge-cloud continuum
    Song, Hui
    Dautov, Rustem
    Ferry, Nicolas
    Solberg, Arnor
    Fleurey, Franck
    SOFTWARE AND SYSTEMS MODELING, 2022, 21 (05) : 1931 - 1956
  • [7] Lightning Talk: Efficient Embedded Machine Learning Deployment on Edge and IoT Devices
    Pasricha, Sudeep
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [8] EdgeInsight: Characterizing and Modeling the Performance of Machine Learning Inference on the Edge and Cloud
    Ross, Philipp
    Luckow, Andre
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1897 - 1906
  • [9] Expanding the cloud-to-edge continuum to the IoT in serverless federated learning
    Loconte, Davide
    Ieva, Saverio
    Pinto, Agnese
    Loseto, Giuseppe
    Scioscia, Floriano
    Ruta, Michele
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 155 : 447 - 462
  • [10] Polaris Scheduler: Edge Sensitive and SLO Aware Workload Scheduling in Cloud-Edge-IoT Clusters
    Nastic, Stefan
    Pusztai, Thomas
    Morichetta, Andrea
    Pujol, Victor Casamayor
    Dustdar, Schahram
    Vij, Deepak
    Xiong, Ying
    2021 IEEE 14TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2021), 2021, : 206 - 216