Flexible Deployment of Machine Learning Inference Pipelines in the Cloud-Edge-IoT Continuum

被引:2
作者
Bogacka, Karolina [1 ,2 ]
Sowinski, Piotr [1 ,2 ]
Danilenka, Anastasiya [1 ,2 ]
Biot, Francisco Mahedero [3 ]
Wasielewska-Michniewska, Katarzyna [1 ]
Ganzha, Maria [1 ,2 ]
Paprzycki, Marcin [1 ]
Palau, Carlos E. [3 ]
机构
[1] Polish Acad Sci, Syst Res Inst, Ul Newelska 6, PL-01447 Warsaw, Poland
[2] Warsaw Univ Technol, Fac Math & Informat Sci, Ul Koszykowa 75, PL-00662 Warsaw, Poland
[3] Univ Politecn Valencia, Commun Dept, Cami Vera S N, Valencia 46022, Spain
关键词
machine learning; edge computing; IoT; cloud-edge-IoT; inference; gRPC; inference server; INTERNET;
D O I
10.3390/electronics13101888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, deploying machine learning workloads in the Cloud-Edge-IoT continuum is challenging due to the wide variety of available hardware platforms, stringent performance requirements, and the heterogeneity of the workloads themselves. To alleviate this, a novel, flexible approach for machine learning inference is introduced, which is suitable for deployment in diverse environments-including edge devices. The proposed solution has a modular design and is compatible with a wide range of user-defined machine learning pipelines. To improve energy efficiency and scalability, a high-performance communication protocol for inference is propounded, along with a scale-out mechanism based on a load balancer. The inference service plugs into the ASSIST-IoT reference architecture, thus taking advantage of its other components. The solution was evaluated in two scenarios closely emulating real-life use cases, with demanding workloads and requirements constituting several different deployment scenarios. The results from the evaluation show that the proposed software meets the high throughput and low latency of inference requirements of the use cases while effectively adapting to the available hardware. The code and documentation, in addition to the data used in the evaluation, were open-sourced to foster adoption of the solution.
引用
收藏
页数:31
相关论文
共 50 条
  • [31] Fall detection in older adults with mobile IoT devices and machine learning in the cloud and on the edge
    Mrozek, Dariusz
    Koczur, Anna
    Malysiak-Mrozek, Bozena
    INFORMATION SCIENCES, 2020, 537 : 132 - 147
  • [32] Mobility-Aware IoT Application Placement in the Cloud - Edge Continuum
    Kimovski, Dragi
    Mehran, Narges
    Kerth, Christopher Emanuel
    Prodan, Radu
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (06) : 3358 - 3371
  • [33] Robustness via Elasticity Accelerators for the IoT-Edge-Cloud Continuum
    Hong-Linh Truong
    Magoutis, Kostas
    2022 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC, 2022, : 291 - 296
  • [34] Decentralized Serverless IoT Dataflow Architecture for the Cloud-to-Edge Continuum
    Lopez Escobar, Juan Jose
    Gil-Castineira, Felipe
    Diaz Redondo, Rebeca P.
    2023 26TH CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS AND WORKSHOPS, ICIN, 2023,
  • [35] Machine Learning for Cloud and IoT-Based Smart Agriculture
    Et-taibi, Bouali
    Abid, Mohamed Riduan
    Boufounas, El-Mahjoub
    Bourhnane, Safae
    Benhaddou, Driss
    ADVANCES IN CONTROL POWER SYSTEMS AND EMERGING TECHNOLOGIES, VOL 2, ICESA 2023, 2024, : 181 - 187
  • [36] Offloading Using Traditional Optimization and Machine Learning in Federated Cloud-Edge-Fog Systems: A Survey
    Kar, Binayak
    Yahya, Widhi
    Lin, Ying-Dar
    Ali, Asad
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2023, 25 (02): : 1199 - 1226
  • [37] The Cloud-to-Edge-to-IoT Continuum as an Enabler for Search and Rescue Operations
    Militano, Leonardo
    Arteaga, Adriana
    Toffetti, Giovanni
    Mitton, Nathalie
    FUTURE INTERNET, 2023, 15 (02):
  • [38] Latency-Aware Deployment of IoT Services in a Cloud-Edge Environment
    Zhang, Shouli
    Liu, Chen
    Wang, Jianwu
    Yang, Zhongguo
    Han, Yanbo
    Li, Xiaohong
    SERVICE-ORIENTED COMPUTING (ICSOC 2019), 2019, 11895 : 231 - 236
  • [39] Machine learning for cloud, fog, edge and serverless computing environments: comparisons, performance evaluation benchmark and future directions
    Singh, Parminder
    Kaur, Avinash
    Gill, Sukhpal Singh
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2022, 13 (04) : 447 - 457
  • [40] A Set of Tools and Data Management Framework for the IoT-Edge-Cloud Continuum
    Judvaitis, Janis
    Blumbergs, Eduards
    Arzovs, Audris
    Mackus, Andris Ivars
    Balass, Rihards
    Selavo, Leo
    APPLIED SYSTEM INNOVATION, 2024, 7 (06)