A Collision-Mitigation Cuckoo Hashing Scheme for Large-Scale Storage Systems

被引:9
作者
Sun, Yuanyuan [1 ]
Hua, Yu [1 ]
Feng, Dan [1 ]
Yang, Ling [1 ]
Zuo, Pengfei [1 ]
Cao, Shunde [1 ]
Guo, Yuncheng [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan Natl Lab Optoelectron, Wuhan 430074, Peoples R China
关键词
Cuckoo hashing; cloud storage; data insertion and query; CLOUD; RETRIEVAL; SEARCH; TIME;
D O I
10.1109/TPDS.2016.2594763
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the rapid growth of the amount of information, cloud computing servers need to process and analyze large amounts of high-dimensional and unstructured data timely and accurately. This usually requires many query operations. Due to simplicity and ease of use, cuckoo hashing schemes have been widely used in real-world cloud-related applications. However, due to the potential hash collisions, the cuckoo hashing suffers from endless loops and high insertion latency, even high risks of re-construction of entire hash table. In order to address these problems, we propose a cost-efficient cuckoo hashing scheme, called MinCounter. The idea behind MinCounter is to alleviate the occurrence of endless loops in the data insertion by selecting unbusy kicking-out routes. MinCounter selects the "cold" (infrequently accessed), rather than random, buckets to handle hash collisions. We further improve the concurrency of the MinCounter scheme to pursue higher performance and adapt to concurrent applications. MinCounter has the salient features of offering efficient insertion and query services and delivering high performance of cloud servers, as well as enhancing the experiences for cloud users. We have implemented MinCounter in a large-scale cloud testbed and examined the performance by using three real-world traces. Extensive experimental results demonstrate the efficacy and efficiency of MinCounter.
引用
收藏
页码:619 / 632
页数:14
相关论文
共 61 条
[1]  
[Anonymous], 2015, MATH PROBL ENG
[2]  
[Anonymous], 2014, BIORESOUR BIOPROCESS
[3]  
[Anonymous], 2013, NSDI
[4]  
[Anonymous], 2014, 1672 IDC
[5]   A View of Cloud Computing [J].
Armbrust, Michael ;
Fox, Armando ;
Griffith, Rean ;
Joseph, Anthony D. ;
Katz, Randy ;
Konwinski, Andy ;
Lee, Gunho ;
Patterson, David ;
Rabkin, Ariel ;
Stoica, Ion ;
Zaharia, Matei .
COMMUNICATIONS OF THE ACM, 2010, 53 (04) :50-58
[6]  
Baeza-Yates Ricardo A., 1991, HDB ALGORITHMS DATA
[7]  
Björkqvist M, 2011, IEEE INFOCOM SER, P1080, DOI 10.1109/INFCOM.2011.5934883
[8]   REDUCING RETRIEVAL TIME OF SCATTER STORAGE TECHNIQUES [J].
BRENT, RP .
COMMUNICATIONS OF THE ACM, 1973, 16 (02) :105-109
[9]   Continuous Cloud-Scale Query Optimization and Processing [J].
Bruno, Nicolas ;
Jain, Sapna ;
Zhou, Jingren .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11) :961-972
[10]  
Bykov S., 2011, P 2 ACM S CLOUD COMP