AA-Dedupe: An Application-Aware Source Deduplication Approach for Cloud Backup Services in the Personal Computing Environment

被引:51
作者
Fu, Yinjin [1 ]
Jian, Hong [2 ]
Xiao, Nong [1 ]
Tian, Lei [2 ]
Liu, Fang [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
[2] Univ Nebraska, Dept Comp Sci & Engn, Lincoln, NE 68588 USA
来源
2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) | 2011年
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
D O I
10.1109/CLUSTER.2011.20
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The market for cloud backup services in the personal computing environment is growing due to large volumes of valuable personal and corporate data being stored on desktops, laptops and smartphones. Source deduplication has become a mainstay of cloud backup that saves network bandwidth and reduces storage space. However, there are two challenges facing deduplication for cloud backup service clients: (1) low deduplication efficiency due to a combination of the resource-intensive nature of deduplication and the limited system resources on the PC-based client site; and (2) low data transfer efficiency since post-deduplication data transfers from source to backup servers are typically very small but must often cross a WAN. In this paper, we present AA-Dedupe, an application-aware source deduplication scheme, to significantly reduce the computational overhead, increase the deduplication throughput and improve the data transfer efficiency. The AA-Dedupe approach is motivated by our key observations of the substantial differences among applications in data redundancy and deduplication characteristics, and thus is based on an application-aware index structure that effectively exploits this application awareness. Our experimental evaluations, based on an AA-Dedupe prototype implementation, show that our scheme can improve deduplication efficiency over the state-of-art source-deduplication methods by a factor of 2-7, resulting in shortened backup window, increased power-efficiency and reduced cost for cloud backup services.
引用
收藏
页码:112 / 120
页数:9
相关论文
共 24 条
[1]  
Aggarwal B., 2010, USENIX SYMPOSIM NETW, P419
[2]  
Agrawal N, 2007, USENIX ASSOCIATION PROCEEDINGS OF THE 5TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES ( FAST '07), P31
[3]  
[Anonymous], 2009, CLOUD STOR CLOUD COM
[4]  
[Anonymous], 2009, CLOUDS BERKELEY VIEW
[5]  
[Anonymous], 2010, 24 LARGE INSTALLATIO
[6]  
[Anonymous], EMC AV
[7]  
[Anonymous], 2009, 7 USENIX C FIL STOR
[8]  
[Anonymous], 2009, INFORM MANAGEMENT J, V43/5, P20
[9]  
[Anonymous], DATA LOSS STAT
[10]  
[Anonymous], 2005, HPL200530R1