Exploiting the Data Redundancy Locality to Improve the Performance of Deduplication-based Storage Systems

被引:0
作者
Wu, Suzhen [1 ]
Chen, Xiao [1 ]
Mao, Bo [2 ]
机构
[1] Xiamen Univ, Dept Comp Sci, Xiamen Shi, Fujian Sheng, Peoples R China
[2] Xiamen Univ, Software Sch, Xiamen Shi, Fujian Sheng, Peoples R China
来源
2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) | 2016年
关键词
Storage Systems; Data Deduplication; Redundancy Locality; Performance Evaluation;
D O I
10.1109/ICPADS.2016.74
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The chunk-lookup disk bottleneck and the read amplification problems are two great challenges for deduplication-based storage systems and restrict the applicability of data deduplication for large-scale data volumes. Previous studies and our experimental evaluations have shown that the amount of redundant data shared among different types of applications is negligible. Based on the observations, we propose AA-Plus which effectively groups the hash index of the same application together and divides the whole hash index into different groups based on the application types. Moreover, it groups the data chunks of the same application together on the disks. The extensive trace-driven experiments conducted on our lightweight prototype implementation of AA-Plus show that compared with AA-Dedupe, AA-Plus significantly speeds up the write throughput by a factor of up to 6.9 and with an average of 3.1, and speeds up the read throughput by a factor of up to 3.3 and with an average of 1.9.
引用
收藏
页码:527 / 534
页数:8
相关论文
共 26 条
  • [1] [Anonymous], P 28 IEEE INT PAR DI
  • [2] [Anonymous], 2011, P FAST 2
  • [3] [Anonymous], 2011, P 9 USENIX C FIL STO
  • [4] [Anonymous], 2008, FAST
  • [5] [Anonymous], 2009, 7 USENIX C FIL STOR
  • [6] Bhagwat D, 2009, 2009 IEEE INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS & SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), P237
  • [7] Chen Feng, 2011, P 9 USENIX C FIL STO
  • [8] Debnath Biplob K, 2010, P 2010 USENIX ANN TE, P1
  • [9] El-Shimi A., 2012, P 2012 USENIX ANN TE
  • [10] Reducing Fragmentation for In-line Deduplication Backup Storage via Exploiting Backup History and Cache Knowledge
    Fu, Min
    Feng, Dan
    Hua, Yu
    He, Xubin
    Chen, Zuoning
    Liu, Jingning
    Xia, Wen
    Huang, Fangting
    Liu, Qing
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (03) : 855 - 868