A model to compare cloud and non-cloud storage of Big Data

被引:88
作者
Chang, Victor [1 ]
Wills, Gary [2 ]
机构
[1] Leeds Beckett Univ, Sch Comp Creat Technol & Engn, Leeds, W Yorkshire, England
[2] Univ Southampton, Sch Elect & Comp Sci, Southampton, Hants, England
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2016年 / 57卷
关键词
Organizational sustainability modeling (OSM); Comparison between Cloud and non-Cloud storage platforms; Real Cloud case studies; Data analysis and visualization;
D O I
10.1016/j.future.2015.10.003
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
When comparing Cloud and non-Cloud Storage it can be difficult to ensure that the comparison is fair. In this paper we examine the process of setting up such a comparison and the metric used. Performance comparisons on Cloud and non-Cloud systems, deployed for biomedical scientists, have been conducted to identify improvements of efficiency and performance. Prior to the experiments, network latency, file size and job failures were identified as factors which degrade performance and experiments were conducted to understand their impacts. Organizational Sustainability Modeling (OSM) is used before, during and after the experiments to ensure fair comparisons are achieved. OSM defines the actual and expected execution time, risk control rates and is used to understand key outputs related to both Cloud and non Cloud experiments. Forty experiments on both Cloud and non-Cloud systems were undertaken with two case studies. The first case study was focused on transferring and backing up 10,000 files of 1 GB each and the second case study was focused on transferring and backing up 1000 files 10 GB each. Results showed that first, the actual and expected execution time on the Cloud was lower than on the non-Cloud system. Second, there was more than 99% consistency between the actual and expected execution time on the Cloud while no comparable consistency was found on the non-Cloud system. Third, the improvement in efficiency was higher on the Cloud than the non-Cloud. OSM is the metric used to analyze the collected data and provided synthesis and insights to the data analysis and visualization of the two case studies. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:56 / 76
页数:21
相关论文
共 31 条
[1]  
Agresti Alan., 2010, Wiley Series in Probability and Statistics, V2nd, DOI DOI 10.1002/9780470594001
[2]  
[Anonymous], 2009, INFORM SECURITY MANA
[3]  
[Anonymous], PROPOSED MODEL ANAL
[4]  
[Anonymous], 2010, ACM SIGOPSOper. Syst. Rev., DOI DOI 10.1145/1842733.1842736
[5]  
[Anonymous], 2014, INT J BIG DATA INTEL, DOI DOI 10.1504/IJBDI.2014.066954
[6]  
Bowers KD, 2009, CCS'09: PROCEEDINGS OF THE 16TH ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, P187
[7]   Comparative analysis of architectures for monitoring cloud computing infrastructures [J].
Calero, Jose M. Alcaraz ;
Gutierrez Aguado, Juan .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 47 :16-30
[8]  
Calheiros R., 2014, IEEE T CLOUD COMPUT, VPP, P99
[9]  
CHANDY KM, 1978, COMPUT SURV, V10, P281, DOI 10.1145/356733.356737
[10]  
Chang V., 2015, BIG DATA SYSTEM DISA