Real-Time Data Center's Telemetry Reduction and Reconstruction Using Markov Chain Models

被引:6
作者
Baig, Shuja-ur-Rehman [1 ,2 ]
Iqbal, Waheed [3 ]
Berral, Josep Lluis [1 ,2 ]
Erradi, Abdelkarim [4 ]
Carrera, David [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona 08034, Spain
[2] Univ Politecn Cataluna, ES-08034 Barcelona, Spain
[3] Univ Punjab, Punjab Univ Coll Informat Technol, Lahore 54590, Pakistan
[4] Qatar Univ, Dept Comp Sci & Engn, Doha 2713, Qatar
来源
IEEE SYSTEMS JOURNAL | 2019年 / 13卷 / 04期
基金
欧洲研究理事会;
关键词
Data center monitoring; data reconstruction; data reduction; Markov chain models (MMs); polynomial regression (PR); real time; telemetry; DIMENSIONALITY REDUCTION;
D O I
10.1109/JSYST.2019.2918430
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale data centers are composed of thousands of servers organized in interconnected racks to offer services to users. These data centers continuously generate large amounts of telemetry data streams (e.g., hardware utilization metrics) used for multiple purposes, including resource management, workload characterization, resource utilization prediction, capacity planning, and real-time analytics. These telemetry streams require costly bandwidth utilization and storage space, particularly at medium-long term for large data centers. This paper addresses this problem by proposing and evaluating a system to efficiently reduce bandwidth and storage for telemetry data through real-time modeling using Markov chain based methods. Our proposed solution was evaluated using real telemetry datasets and compared with polynomial regression methods for reducing and reconstructing data. Experimental results show that data can be lossy compressed up to 75% for bandwidth utilization and 95.33% for storage space, with reconstruction accuracy close to 92%.
引用
收藏
页码:4039 / 4050
页数:12
相关论文
共 47 条
[1]  
[Anonymous], 2014, A First Course in Stochastic Processes
[2]  
[Anonymous], 2007, Information retrieval for music and motion
[3]  
[Anonymous], 2016, 2016 CHINA INT C ELE
[4]  
[Anonymous], 1925, MATH PROC CAMBRIDGE
[5]  
[Anonymous], 1999, Proceedings of the Ijcai-99 Workshop on Neural, Symbolic and Reinforcement Learning Methods for Sequence Learning
[6]   Performance Characterization of Spark Workloads on Shared NUMA Systems [J].
Baig, Shuja-ur-Rehman ;
Amaral, Marcelo ;
Polo, Jorda ;
Carrera, David .
2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2018), 2018, :41-48
[7]  
Berndt D.J., 1994, Advances in Knowledge Discovery and Data Mining, P359
[8]  
BHUIYAN MZA, 2017, GLOBECOM 2017 2017 I, P1, DOI DOI 10.1109/GLOCOM.2017.8254997
[9]  
Birke R., 2012, ACM ICAC, P161
[10]   FPC: A High-Speed Compressor for Double-Precision Floating-Point Data [J].
Burtscher, Martin ;
Ratanaworabhan, Paruj .
IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (01) :18-31