A survey on data storage and placement methodologies for Cloud-Big Data ecosystem

被引:60
作者
Mazumdar, Somnath [1 ]
Seybold, Daniel [2 ]
Kritikos, Kyriakos [3 ]
Verginadis, Yiannis [4 ]
机构
[1] Simula Res Lab, N-1325 Lysaker, Norway
[2] Ulm Univ, Ulm, Germany
[3] ICS FORTH, Iraklion, Crete, Greece
[4] Inst Commun & Comp Syst ICCS, 9 Iroon Polytech Str, Athens, Greece
关键词
Big Data; Cloud; Data models; Data storage; Placement; DATA-INTENSIVE APPLICATIONS; AWARE DATA PLACEMENT; MANAGEMENT; ELASTICITY; SYSTEMS; TECHNOLOGIES; SCALABILITY; CHALLENGES; DATABASES; SERVICES;
D O I
10.1186/s40537-019-0178-3
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Currently, the data to be explored and exploited by computing systems increases at an exponential rate. The massive amount of data or so-called "Big Data" put pressure on existing technologies for providing scalable, fast and efficient support. Recent applications and the current user support from multi-domain computing, assisted in migrating from data-centric to knowledge-centric computing. However, it remains a challenge to optimally store and place or migrate such huge data sets across data centers (DCs). In particular, due to the frequent change of application and DC behaviour (i.e., resources or latencies), data access or usage patterns need to be analyzed as well. Primarily, the main objective is to find a better data storage location that improves the overall data placement cost as well as the application performance (such as throughput). In this survey paper, we are providing a state of the art overview of Cloud-centric Big Data placement together with the data storage methodologies. It is an attempt to highlight the actual correlation between these two in terms of better supporting Big Data management. Our focus is on management aspects which are seen under the prism of non-functional properties. In the end, the readers can appreciate the deep analysis of respective technologies related to the management of Big Data and be guided towards their selection in the context of satisfying their non-functional application requirements. Furthermore, challenges are supplied highlighting the current gaps in Big Data management marking down the way it needs to evolve in the near future.
引用
收藏
页数:37
相关论文
共 109 条
[1]  
Abadi D. J., 2009, IEEE Data Eng. Bull, V32, P3
[2]  
Agrawal D, 2011, LECT NOTES COMPUT SC, V6587, P2, DOI 10.1007/978-3-642-20149-3_2
[3]  
ALLEN MS, 2003, P 2003 ACM IEEE C SU, P43
[4]  
Ananthanarayanan Ganesh., 2011, P 13 USENIX C HOT TO, P12
[5]  
[Anonymous], 2015, DAT DRIV INN BIG DAT
[6]  
[Anonymous], 2016, 2016 IEEE Inter. Symp. or Radio‐Frequency Integration Technology (RFIT)
[7]  
[Anonymous], 2008, Queue
[8]  
Aslett M., 2011, WILL DATABASE INCUMB, V451, P1
[9]  
Bader A., 2017, Datenbanksysteme fur Business, Technologie und Web (BTW 2017) - Workshopband, P249
[10]  
Ball A., 2012, REV DATA MANAGEMENT