Data-Driven Workload Generation Based on Google Data Center Measurements

被引:1
|
作者
Yildiz, Mert [1 ]
Baiocchi, Andrea [1 ]
机构
[1] Univ Roma Sapienza, DIET, Rome, Italy
来源
2024 IEEE 25TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING, HPSR 2024 | 2024年
关键词
Workload modeling; data centers; traffic measurements; data fitting; large server clusters;
D O I
10.1109/HPSR62440.2024.10635925
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A large dataset of workload measurements has been released by Google. The wealth of disclosed data allows a deep dive into real workload patterns. With the aim of providing tools to generate realistic workloads in a simple way, we have extracted from Google's dataset job arrival times, number of tasks per job, required computation time, and memory of tasks. We define a statistical fitting of the relevant probability distribution, providing a simple tool to build artificial workload traces that mimic real traffic as represented by Google measurements. The workload generation algorithm is assessed by comparison of its mean response time on a test dispatching/scheduling system against the real traffic traces. In spite of being only a first-order generation model, it is shown that the proposed artificial generation can reproduce faithfully the performance of real workload in the case of large server clusters.
引用
收藏
页码:143 / 148
页数:6
相关论文
共 50 条
  • [1] Data-driven crack assessment based on surface measurements
    Schulz, Katrin
    Kreis, Stephan
    Trittenbach, Holger
    Boehm, Klemens
    ENGINEERING FRACTURE MECHANICS, 2019, 218
  • [2] Redefining faculty workload metrics: A data-driven approach
    Johnson, Heather L.
    Cruthirds, Danette F.
    Taylor, Laura A.
    Suszan, Lauren T.
    Owen, Regina P.
    Trautmann, Jennifer L.
    Beatty, Jonathan R.
    Seibert, Diane C.
    JOURNAL OF PROFESSIONAL NURSING, 2024, 55 : 112 - 118
  • [3] Driver cognitive workload estimation: A data-driven perspective
    Zhang, YL
    Owechko, Y
    Zhang, J
    ITSC 2004: 7TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, PROCEEDINGS, 2004, : 642 - 647
  • [4] Workload-Independent Data-Driven Vertical Partitioning
    Bobrov, Nikita
    Chernishev, George
    Novikov, Boris
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2017, 2017, 767 : 275 - 284
  • [5] Diagnosis for PEMFC Based on Magnetic Measurements and Data-Driven Approach
    Li, Zhongliang
    Cadet, Catherine
    Outbib, Rachid
    IEEE TRANSACTIONS ON ENERGY CONVERSION, 2019, 34 (02) : 964 - 972
  • [6] Data-driven Adaptive Control of CRAC in Data Center Based on Online Incremental RVFL
    Huang, Jiangyang
    Zhangl, Zhengxuan
    Yang, Xu
    Tu, Rang
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 962 - 967
  • [7] A framework for data-driven adaptive GUI generation based on DICOM
    Gambino, Orazio
    Rundo, Leonardo
    Cannella, Vincenzo
    Vitabile, Salvatore
    Pirrone, Roberto
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 88 : 37 - 52
  • [8] Next-generation data center energy management: a data-driven decision-making framework
    Milic, Vlatko
    FRONTIERS IN ENERGY RESEARCH, 2024, 12
  • [9] Data-Driven Characterization and Modeling of Web Map System Workload
    Braga, Vinicius Goncalves
    Correa, Sand Luz
    Cardoso, Kleber Vieira
    Viana, Aline Carneiro
    IEEE ACCESS, 2021, 9 : 26983 - 27002
  • [10] Boosting Data-Driven Evolutionary Algorithm With Localized Data Generation
    Li, Jian-Yu
    Zhan, Zhi-Hui
    Wang, Chuan
    Jin, Hu
    Zhang, Jun
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (05) : 923 - 937