High-speed data deduplication using parallelized cuckoo hashing

被引:1
|
作者
Jeyaraj, Jane Rubel Angelina [1 ]
Kambaraj, Sundarakantham [1 ]
Dharmarajan, Velmurugan [1 ]
机构
[1] Thiagarajar Coll Engn, Dept Comp Sci & Engn, Madurai, Tamil Nadu, India
关键词
Deduplication; parallelized cuckoo; backup;
D O I
10.3906/elk-1708-336
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data deduplication is a capacity optimization technology used in backup systems for identifying and storing the nonredundant data blocks. The CPU intensive tasks involved in a hash-based deduplication system remain as challenges in improving the performance of the system. In this paper, we propose a parallel variant of the standard cuckoo hashing that enables the hashing technique to be performed in parallel. The CPU intensive tasks of fingerprint insertion and lookup operations are performed in parallel and distributed among the nodes of the deduplication cluster. Furthermore, the uniform handling of the blocks by the cluster nodes involved in the process of duplicate identification provides good load balance. Experimental evaluations using real-world backup and Linux kernel data sets reveal that the proposed deduplication system achieves up to 100% higher backup speed, up to 28% reduced lookup latency, and up to 24% reduced backup time than the other deduplication systems.
引用
收藏
页码:1417 / 1429
页数:13
相关论文
共 50 条
  • [1] High-Speed Data Transfer Using PLC
    Misurec, Jiri
    Orgon, Milos
    2018 25TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2018,
  • [2] Data De-duplication Using Cuckoo Hashing in Cloud Storage
    Sridharan, J.
    Valliyammai, C.
    Karthika, R. N.
    Kulasekaran, L. Nihil
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 707 - 715
  • [3] XOR Hashing Algorithms to Measured Flows at the High-Speed Link
    Guang, Cheng
    Wei, Zhao
    Jian, Gong
    FGCN: PROCEEDINGS OF THE 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING, VOLS 1 AND 2, 2008, : 150 - +
  • [4] High-speed Data Processing through Ultra-high-speed Data Management Using InfiniBand
    Yamamoto, Shoji
    Yamada, Toshiaki
    Shimabayashi, Daisuke
    Sarashina, Hideo
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2015, 51 (02): : 48 - 54
  • [5] High-speed data acquisition
    Lockhart, RW
    MEASUREMENTS & CONTROL, 1999, (194): : 103 - 106
  • [6] HIGH-SPEED DATA RECORDING
    CLARK, G
    PHOTOGRAPHIC SCIENCE AND ENGINEERING, 1963, 7 (02): : 140 - 140
  • [7] HIGH-SPEED DATA CABLES
    HUNTER, S
    ENGINEERING, 1986, 226 (02): : R1 - R3
  • [8] HIGH-SPEED DATA WITH A TWIST
    SEAMAN, J
    COMPUTER DECISIONS, 1985, 17 (16): : 50 - &
  • [9] High-speed data transfer
    Research & Development (Barrington, Illinois), 2000, 42 (02):
  • [10] Efficient Hashing technique based on Bloom filter for High-Speed Network
    He, Gang
    Du, Yanzhe
    Yu, Dechen
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 1, 2016, : 58 - 63