Automatic Generation of Workload Profiles Using Unsupervised Learning Pipelines

被引:13
作者
Buchaca Prats, David [1 ,2 ]
Lluis Berral, Josep [1 ,2 ]
Carrera, David [1 ,2 ]
机构
[1] Univ Politecn Cataluna, Dept Arquitectura Comp, ES-08034 Barcelona, Spain
[2] Barcelona Supercomp Ctr, Data Centr Comp, Barcelona 08034, Spain
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2018年 / 15卷 / 01期
基金
欧洲研究理事会;
关键词
Unsupervised learning; CRBM; deep learning; workload modeling; phase detection; MapReduce;
D O I
10.1109/TNSM.2017.2786047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The complexity of resource usage and power consumption on cloud-based applications makes the understanding of application behavior through expert examination difficult. The difficulty increases when applications are seen as "black boxes," where only external monitoring can be retrieved. Furthermore, given the different amount of scenarios and applications, automation is required. Here, we examine and model application behavior by finding behavior phases. We use conditional restricted Boltzmann machines (CRBMs) to model time-series containing resources traces measurements like CPU, memory, and IO. CRBMs can be used to map a given historic window of trace behavior into a single vector. This low dimensional and time-aware vector can be passed through clustering methods, from simplistic ones like k-means to more complex ones like those based on hidden Markov models. We use these methods to find phases of similar behavior in the workloads. Our experimental evaluation shows that the proposed method is able to identify different phases of resource consumption across different workloads. We show that the distinct phases contain specific resource patterns that distinguish them.
引用
收藏
页码:142 / 155
页数:14
相关论文
共 38 条
[1]  
Amara Mustapha., 2017, Personal, Indoor, and Mobile Radio Communications (PIMRC), P1
[2]  
[Anonymous], 2017, HAD
[3]  
[Anonymous], 2017, NETFL ATL
[4]  
[Anonymous], 2014, 2014 IEEE INT C IC D
[5]  
[Anonymous], 2008, ADV NEURAL INFORM PR
[6]  
[Anonymous], 2012, P 4 USENIX C HOT TOP
[7]   Performance Characterization of In-Memory Data Analytics on a Modern Cloud Server [J].
Awan, Ahsan Javed ;
Brorsson, Mats ;
Vlassov, Vladimir ;
Ayguade, Eduard .
PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, :1-8
[8]   ALOJA-ML: A Framework for Automating Characterization and Knowledge Discovery in Hadoop Deployments [J].
Berral, Josep Ll. ;
Poggi, Nicolas ;
Carrera, David ;
Call, Aaron ;
Reinauer, Rob ;
Green, Daron .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :1701-1710
[9]   A portable programming interface for performance evaluation on modern processors [J].
Browne, S ;
Dongarra, J ;
Garner, N ;
Ho, G ;
Mucci, P .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2000, 14 (03) :189-204
[10]   Forecasting High Dimensional Volatility Using Conditional Restricted Boltzmann Machine on GPU [J].
Cai, Xianggao ;
Lin, Xiaola .
2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, :1979-1986