Flexible Deployment of Machine Learning Inference Pipelines in the Cloud-Edge-IoT Continuum

被引:2
作者
Bogacka, Karolina [1 ,2 ]
Sowinski, Piotr [1 ,2 ]
Danilenka, Anastasiya [1 ,2 ]
Biot, Francisco Mahedero [3 ]
Wasielewska-Michniewska, Katarzyna [1 ]
Ganzha, Maria [1 ,2 ]
Paprzycki, Marcin [1 ]
Palau, Carlos E. [3 ]
机构
[1] Polish Acad Sci, Syst Res Inst, Ul Newelska 6, PL-01447 Warsaw, Poland
[2] Warsaw Univ Technol, Fac Math & Informat Sci, Ul Koszykowa 75, PL-00662 Warsaw, Poland
[3] Univ Politecn Valencia, Commun Dept, Cami Vera S N, Valencia 46022, Spain
关键词
machine learning; edge computing; IoT; cloud-edge-IoT; inference; gRPC; inference server; INTERNET;
D O I
10.3390/electronics13101888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, deploying machine learning workloads in the Cloud-Edge-IoT continuum is challenging due to the wide variety of available hardware platforms, stringent performance requirements, and the heterogeneity of the workloads themselves. To alleviate this, a novel, flexible approach for machine learning inference is introduced, which is suitable for deployment in diverse environments-including edge devices. The proposed solution has a modular design and is compatible with a wide range of user-defined machine learning pipelines. To improve energy efficiency and scalability, a high-performance communication protocol for inference is propounded, along with a scale-out mechanism based on a load balancer. The inference service plugs into the ASSIST-IoT reference architecture, thus taking advantage of its other components. The solution was evaluated in two scenarios closely emulating real-life use cases, with demanding workloads and requirements constituting several different deployment scenarios. The results from the evaluation show that the proposed software meets the high throughput and low latency of inference requirements of the use cases while effectively adapting to the available hardware. The code and documentation, in addition to the data used in the evaluation, were open-sourced to foster adoption of the solution.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] A Discussion on Context-Awareness to Better Support the IoT Cloud/Edge Continuum
    Da Silva, Daniel Maniglia Amancio
    Sofia, Rute C.
    IEEE ACCESS, 2020, 8 (08): : 193686 - 193694
  • [42] Elder Care System using IoT and Machine Learning in AWS Cloud
    Srinivasan, Aparajith
    Natarajan, Nithya
    Karunakaran, Raj Vignesh
    Elangovan, Ramya
    Shankar, Abirami
    Padmanaaban M, Sabharish
    Sreeja, B. S.
    Radha, S.
    2020 IEEE 17TH INTERNATIONAL CONFERENCE ON SMART COMMUNITIES: IMPROVING QUALITY OF LIFE USING ICT, IOT AND AI (IEEEHONET 2020), 2020, : 92 - 98
  • [43] Design of Platform-Independent IoT Applications in the Edge-Cloud Continuum
    Marozzo, Fabrizio
    Vinci, Andrea
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 589 - 594
  • [44] Machine Learning for Edge-Aware Resource Orchestration for IoT Applications
    Jammal, Manar
    AbuSharkh, Mohamed
    2021 IEEE GLOBAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (GCAIOT), 2021, : 37 - 44
  • [45] Resource Allocation With Edge Computing in IoT Networks via Machine Learning
    Liu, Xiaolan
    Yu, Jiadong
    Wang, Jian
    Gao, Yue
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (04) : 3415 - 3426
  • [46] Scalable Edge Computing for IoT and Multimedia Applications Using Machine Learning
    Babar, Mohammad
    Khan, Muhammad Sohail
    Habib, Usman
    Shah, Babar
    Ali, Farman
    Song, Dongho
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2021, 11
  • [47] Machine Learning and Enhanced Encryption for Edge Computing in IoT and Wireless Networks
    Hardas, Bhalchandra M.
    Raut, Vaishali
    Palsodkar, Prasanna
    Aush, Mithun G.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 200 - 210
  • [48] Cyber Security on the Edge: Efficient Enabling of Machine Learning on IoT Devices
    Kumari, Swati
    Tulshyan, Vatsal
    Tewari, Hitesh
    INFORMATION, 2024, 15 (03)
  • [49] Edge Machine Learning for AI-Enabled IoT Devices: A Review
    Merenda, Massimo
    Porcaro, Carlo
    Iero, Demetrio
    SENSORS, 2020, 20 (09)