Flexible Deployment of Machine Learning Inference Pipelines in the Cloud-Edge-IoT Continuum

被引:2
作者
Bogacka, Karolina [1 ,2 ]
Sowinski, Piotr [1 ,2 ]
Danilenka, Anastasiya [1 ,2 ]
Biot, Francisco Mahedero [3 ]
Wasielewska-Michniewska, Katarzyna [1 ]
Ganzha, Maria [1 ,2 ]
Paprzycki, Marcin [1 ]
Palau, Carlos E. [3 ]
机构
[1] Polish Acad Sci, Syst Res Inst, Ul Newelska 6, PL-01447 Warsaw, Poland
[2] Warsaw Univ Technol, Fac Math & Informat Sci, Ul Koszykowa 75, PL-00662 Warsaw, Poland
[3] Univ Politecn Valencia, Commun Dept, Cami Vera S N, Valencia 46022, Spain
关键词
machine learning; edge computing; IoT; cloud-edge-IoT; inference; gRPC; inference server; INTERNET;
D O I
10.3390/electronics13101888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, deploying machine learning workloads in the Cloud-Edge-IoT continuum is challenging due to the wide variety of available hardware platforms, stringent performance requirements, and the heterogeneity of the workloads themselves. To alleviate this, a novel, flexible approach for machine learning inference is introduced, which is suitable for deployment in diverse environments-including edge devices. The proposed solution has a modular design and is compatible with a wide range of user-defined machine learning pipelines. To improve energy efficiency and scalability, a high-performance communication protocol for inference is propounded, along with a scale-out mechanism based on a load balancer. The inference service plugs into the ASSIST-IoT reference architecture, thus taking advantage of its other components. The solution was evaluated in two scenarios closely emulating real-life use cases, with demanding workloads and requirements constituting several different deployment scenarios. The results from the evaluation show that the proposed software meets the high throughput and low latency of inference requirements of the use cases while effectively adapting to the available hardware. The code and documentation, in addition to the data used in the evaluation, were open-sourced to foster adoption of the solution.
引用
收藏
页数:31
相关论文
共 50 条
  • [21] Flexible and Efficient Deployment of Data Processing Pipelines on Wireless IoT Systems
    Polychronis, Giorgos
    Koutsoubelias, Manos
    Pournaropoulos, Foivos
    Lalis, Spyros
    Georgiadis, Lefteris
    Pazios, Thomas
    Tsatsaronis, Stratos
    Vrakidis, Isaias
    2024 IEEE SENSORS APPLICATIONS SYMPOSIUM, SAS 2024, 2024,
  • [22] Enhancing Support for Machine Learning and Edge Computing on an IoT Data Marketplace
    Sajan, Kurian Karyakulam
    Ramachandran, Gowri Sankar
    Krishnamachari, Bhaskar
    PROCEEDINGS OF THE 2019 INTERNATIONAL WORKSHOP ON CHALLENGES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR INTERNET OF THINGS (AICHALLENGEIOT '19), 2019, : 19 - 24
  • [23] An energy efficient IoT data compression approach for edge machine learning
    Azar, Joseph
    Makhoul, Abdallah
    Barhamgi, Mahmoud
    Couturier, Raphael
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 168 - 175
  • [24] Machine Learning for Security at the IoT Edge - A Feasibility Study
    Wang, Han
    Barriga, Luis
    Vahidi, Arash
    Raza, Shahid
    2019 IEEE 16TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS WORKSHOPS (MASSW 2019), 2019, : 7 - 12
  • [25] Machine Learning plus Distributed IoT = Edge Intelligence
    Wolf, Marilyn
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 1715 - 1719
  • [26] A Systematic Review on Federated Learning in Edge-Cloud Continuum
    Sambit Kumar Mishra
    Subham Kumar Sahoo
    Chinmaya Kumar Swain
    SN Computer Science, 5 (7)
  • [27] ClusterSlice: A Zero-touch Deployment Platform for the Edge Cloud Continuum
    Mamatas, Lefteris
    Skaperas, Sotiris
    Sakellariou, Ilias
    PROCEEDINGS OF THE 27TH CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS, ICIN, 2024, : 100 - 102
  • [28] Mobile IoT-Edge-Cloud Continuum Based and DevOps Enabled Software Framework
    Judvaitis, Janis
    Balass, Rihards
    Greitans, Modris
    JOURNAL OF SENSOR AND ACTUATOR NETWORKS, 2021, 10 (04)
  • [29] Machine-Learning-Based IoT-Edge Computing Healthcare Solutions
    Alnaim, Abdulrahman K.
    Alwakeel, Ahmed M.
    ELECTRONICS, 2023, 12 (04)
  • [30] ECBA-MLI: Edge Computing Benchmark Architecture for Machine Learning Inference
    Schneider, Mathias
    Prokscha, Ruben
    Saadani, Seifeddine
    Hoess, Alfred
    2022 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING & COMMUNICATIONS (IEEE EDGE 2022), 2022, : 23 - 32