Selective Data Replication for Online Social Networks with Distributed Datacenters

被引:29
作者
Liu, Guoxin [1 ]
Shen, Haiying [1 ]
Chandler, Harrison [2 ]
机构
[1] Clemson Univ, Dept Elect & Comp Engn, Clemson, SC 29634 USA
[2] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Social networks; datacenter; scalability; data replication; locality;
D O I
10.1109/TPDS.2015.2485266
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Though the new OSN model, which deploys datacenters globally, helps reduce service latency, it causes higher inter-datacenter communication load. In Facebook, each datacenter has a full copy of all data, and the master datacenter updates all other datacenters, generating tremendous load in this new model. Distributed data storage, which only stores a user's data to his/her geographically closest datacenters mitigates the problem. However, frequent interactions between distant users lead to frequent inter-datacenter communication and hence long service latencies. In this paper, we aim to reduce inter-datacenter communications while still achieving low service latency. We first verify the benefits of the new model and present OSN typical properties that underlie the basis of our design. We then propose Selective Data replication mechanism in Distributed Datacenters (SD3). Since replicas need inter-datacenter data updates, datacenters in SD3 jointly consider update rates and visit rates to select user data for replication; furthermore, SD3 atomizes users' different types of data (e.g., status update, friend post, music) for replication, ensuring that a replica always reduces inter-datacenter communication. SD3 also incorporates three strategies to further enhance its performance: locality-aware multicast update tree, replica deactivation, and datacenter congestion control. The results of trace-driven experiments on the real-world PlanetLab testbed demonstrate the higher efficiency and effectiveness of SD3 in comparison to other replication methods and the effectiveness of its three schemes.
引用
收藏
页码:2377 / 2393
页数:17
相关论文
共 47 条
[1]  
Ahn YY, 2007, WWW '07: Proceedings of the 16th international conference on World Wide Web, P835
[2]  
[Anonymous], P 4 ANN S CLOUD COMP
[3]  
[Anonymous], 2010, P 7 USENIX C NETW SY
[4]  
[Anonymous], 2013, ANN TECHN C ATC USEN
[5]  
[Anonymous], 2010, P 3 WORKSH SOC NETW
[6]  
[Anonymous], 2013, 10 USENIX S NETW SYS
[7]  
[Anonymous], LULEA DATA CTR IS FA
[8]  
[Anonymous], MAPREDUCE TUTORIAL
[9]  
[Anonymous], P INF
[10]  
[Anonymous], 2011, SOCIAL NETWORKINGSIT