The orchestration of Machine Learning frameworks with data streams and GPU acceleration in Kafka-ML: A deep-learning performance comparative

被引:7
作者
Chaves, Antonio Jesus [1 ]
Martin, Cristian [1 ]
Diaz, Manuel [1 ]
机构
[1] Univ Malaga, ITIS Software Inst, Malaga, Spain
关键词
artificial intelligence; data streams; Kafka-ML; machine learning; PyTorch; TensorFlow;
D O I
10.1111/exsy.13287
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine Learning (ML) applications need large volumes of data to train their models so that they can make high-quality predictions. Given digital revolution enablers such as the Internet of Things (IoT) and the Industry 4.0, this information is generated in large quantities in terms of continuous data streams and not in terms of static datasets as it is the case with most AI (Artificial Intelligence) frameworks. Kafka-ML is a novel open-source framework that allows the complete management of ML/AI pipelines through data streams. In this article, we present new features for the Kafka-ML framework, such as the support for the well-known ML/AI framework PyTorch, as well as for GPU acceleration at different points along the pipeline. This pipeline will be described by taking a real Industry 4.0 use case in the Petrochemical Industry. Finally, a comprehensive evaluation with state-of-the-art deep learning models will be carried out to demonstrate the feasibility of the platform.
引用
收藏
页数:20
相关论文
共 31 条
  • [1] algorithmia.com, ABOUT US
  • [2] .amazon.com, WHAT IS AMAZON SAGEM
  • [3] ClearML, 2019, CLEARML YOUR ENT MLO
  • [4] cnvrg.io Official Webpage, US
  • [5] Reveal training performance mystery between TensorFlow and PyTorch in the single GPU environment
    Dai, Hulin
    Peng, Xuan
    Shi, Xuanhua
    He, Ligang
    Xiong, Qian
    Jin, Hai
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (01)
  • [6] State-of-the-art, challenges, and open issues in the integration of Internet of things and cloud computing
    Diaz, Manuel
    Martin, Cristian
    Rubio, Bartolome
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 67 : 99 - 117
  • [7] Predictive analytics in education: a comparison of deep learning frameworks
    Doleck, Tenzin
    Lemay, David John
    Basnet, Ram B.
    Bazelais, Paul
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2020, 25 (03) : 1951 - 1963
  • [8] DLBench: a comprehensive experimental evaluation of deep learning frameworks
    Elshawi, Radwa
    Wahab, Abdul
    Barnawi, Ahmed
    Sakr, Sherif
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03): : 2017 - 2038
  • [9] Feast Official Webpage, US
  • [10] Florencio F., 2019, J COMPUT SCI-NETH, V15, P785, DOI [10.3844/jcssp.2019.785.799, DOI 10.3844/JCSSP.2019.785.799]