Data popularity measurements in distributed systems: Survey and design directions

被引:15
作者
Hamdeni, C. [1 ]
Hamrouni, T. [1 ]
Ben Charrada, F. [1 ]
机构
[1] Tunis El Manar Univ, Dept Comp Sci, Fac Sci Tunis, Univ Campus, Tunis, Tunisia
关键词
Distributed system; Replication strategy; Data popularity; Access pattern; Temporal locality; DATA REPLICATION STRATEGY; PLACEMENT STRATEGIES; DYNAMIC REPLICATION; AVAILABILITY; MANAGEMENT; IMPACT;
D O I
10.1016/j.jnca.2016.06.002
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed systems continue to be a promising area of research particularly in terms of providing efficient data access and maximum data availability for large-scale applications. For improving performances of distributed systems, several data replication strategies have been proposed to ensure reliability and data transfer speed as well as to offer the possibility to access the data efficiently from multiple locations. Data popularity is one of the most important parameters taken into consideration when designing data replication strategies. It assesses how much the data is requested by the sites of the system. In this paper, the importance of considering the data popularity parameter in replication management is highlighted. Different strategies are then identified and how they rely on the data popularity parameter is illustrated. Different calculation manners of data popularity are hence studied. This allows us to find out which factors are considered in order to assess data popularity. After classifying them into four categories, this work includes a critical discussion about each category. Some important directions for future work are then discussed towards possible solutions for a more effective data popularity assessment. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:150 / 161
页数:12
相关论文
共 66 条
  • [1] Abad C. L., 2012, 2012 IEEE International Symposium on Workload Characterization (IISWC 2012), P100, DOI 10.1109/IISWC.2012.6402909
  • [2] Aiqiang Gao, 2010, Proceedings 2010 5th International Conference on Pervasive Computing and Applications (ICPCA 2010), P250, DOI 10.1109/ICPCA.2010.5704107
  • [3] Al Mistarihi HHE, 2008, INT J COMPUT SCI NET, V8, P22
  • [4] [Anonymous], AVAILABILITY POPULAR
  • [5] [Anonymous], [No title captured]
  • [6] Barrefors B, 2015, THESIS
  • [7] OptorSim: A grid simulator for studying dynamic data replication strategies
    Bell, WH
    Cameron, DG
    Capozza, L
    Millar, AP
    Stockinger, K
    Zini, F
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2003, 17 (04) : 403 - 416
  • [8] Ben Charrada F, 2011, INT J GRID UTIL COMP, V2, P156, DOI 10.1504/IJGUC.2011.040603
  • [9] Bonacorsi D., 2015, P 21 INT C COMP HIGH, P1
  • [10] A threshold-based dynamic data replication strategy
    Bsoul, Mohammad
    Al-Khasawneh, Ahmad
    Kilani, Yousef
    Obeidat, Ibrahim
    [J]. JOURNAL OF SUPERCOMPUTING, 2012, 60 (03) : 301 - 310