BDWatchdog: Real-time monitoring and profiling of Big Data applications and frameworks

被引:10
作者
Enes, Jonatan [1 ]
Exposito, Roberto R. [1 ]
Tourino, Juan [1 ]
机构
[1] Univ A Coruna, Comp Architecture Grp, Campus A Coruna, La Coruna 15701, Spain
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 87卷
关键词
Big data; Monitoring; Profiling; Time series; Flame graphs; Process-based analysis;
D O I
10.1016/j.future.2017.12.068
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Current Big Data applications are characterized by a heavy use of system resources (e.g., CPU, disk) generally distributed across a cluster. To effectively improve their performance there is a critical need for an accurate analysis of both Big Data workloads and frameworks. This means to fully understand how the system resources are being used in order to identify potential bottlenecks, from resource to code bottlenecks. This paper presents BDWatchdog, a novel framework that allows real-time and scalable analysis of Big Data applications by combining time series for resource monitorization and flame graphs for code profiling, focusing on the processes that make up the workload rather than the underlying instances on which they are executed. This shift from the traditional system-based monitorization to a process-based analysis is interesting for new paradigms such as software containers or serverless computing, where the focus is put on applications and not on instances. BDWatchdog has been evaluated on a Big Data cloud-based service deployed at the CESGA supercomputing center. The experimental results show that a process-based analysis allows for a more effective visualization and overall improves the understanding of Big Data workloads. BDWatchdog is publicly available at http://bdwatchdog.dec.udc.es. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:420 / 437
页数:18
相关论文
共 37 条
[21]  
Liu H., 2008, COMPUT TELECOMMUN, V4, P4, DOI DOI 10.1016/J.BREAST.2008.04.004]
[22]   The ganglia distributed monitoring system: design, implementation, and experience [J].
Massie, ML ;
Chun, BN ;
Culler, DE .
PARALLEL COMPUTING, 2004, 30 (07) :817-840
[23]  
Membrey Peter., 2010, The Definitive Guide to MongoDB: the noSQL Database for Cloud and Desktop Computing
[24]  
PILLET V, 1995, TRANSPUT OCCAM ENG S, V44, P17
[25]  
Poggi N, 2014, IEEE INT CONF BIG DA, P905, DOI 10.1109/BigData.2014.7004322
[26]   GOOGLE-WIDE PROFILING: A CONTINUOUS PROFILING INFRASTRUCTURE FOR DATA CENTERS [J].
Ren, Gang ;
Tune, Eric ;
Moseley, Tipp ;
Shi, Yixin ;
Rus, Silvius ;
Hundt, Robert .
IEEE MICRO, 2010, 30 (04) :65-78
[27]  
Shvachko K., 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), P1
[28]  
Sigoure B., 2011, OREILLY OPEN SOURCE
[29]  
Tader P., 2010, LINUX J, P72
[30]  
The Apache Software Foundation, 2017, HBASE DISTR DAT LARG