A cluster-based data deduplication technology

被引:1
|
作者
Tseng, Chuan-Mu [1 ]
Ciou, Jheng-Rong [2 ]
Liu, Tzong-Jye [2 ]
机构
[1] Jeh Teh Jr Coll Med Nursing & Management, Dept Appl Digital Media, Miaoli, Taiwan
[2] Feng Chia Univ, Dept Informat Engn & Comp Sci, Taichung, Taiwan
来源
2014 SECOND INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR) | 2014年
关键词
Bloom filter; cluster; data deduplication;
D O I
10.1109/CANDAR.2014.22
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data deduplication technology usually identifies redundant data quickly and correctly by using bloom filter technology. A bloom filter can determine whether there is redundant data. However, there are the presences of false positives. In order to avoid false positives, we need to compare a new chunk with chunks that have been stored. In order to reduce the time to exclude the bloom filter false positives, current research uses many small size index tables to store chunk ID. However, the target chunk ID only stores in one index table. Searching for the target chunk ID at another index table uselessly took a great deal of time. In this paper, we cluster the stored chunks to reduce the time of excluding the false positive problem induced by bloom filter.
引用
收藏
页码:226 / 230
页数:5
相关论文
共 50 条
  • [21] Cluster-Based Thermodynamics of Interacting Dice in a Lattice
    Mayer, Christoph
    Wallek, Thomas
    ENTROPY, 2020, 22 (10) : 1 - 19
  • [22] Sparsest Random Sampling for Cluster-Based Compressive Data Gathering in Wireless Sensor Networks
    Sun, Peng
    Wu, Liantao
    Wang, Zhibo
    Xiao, Ming
    Wang, Zhi
    IEEE ACCESS, 2018, 6 : 36383 - 36394
  • [23] A Cluster-Based Watermarking Technique for Relational Database
    Huang, Kaiyin
    Yue, Min
    Chen, Pengfei
    He, Yanshan
    Chen, Xiaoyun
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 107 - +
  • [24] Cluster-based learning and evolution algorithm for optimization
    Loomba, Ashish Kumar
    Botechia, Vinicius Eduardo
    Schiozer, Denis Jose
    GEOENERGY SCIENCE AND ENGINEERING, 2023, 227
  • [25] Cluster-based Magnetic Porous Coordination Polymers
    项生昌
    王欣
    胡胜民
    盛天录
    Chinese Journal of Structural Chemistry, 2009, (11) : 1349 - 1358
  • [26] Cluster-based organisation and retrieval of newsfeed archives
    Geisler, S
    Kao, O
    PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 1033 - 1036
  • [27] CLUSTER-BASED APPROACH: A TOOL TO ENTER INTO THE MARKET
    Kassalis, Ivars
    6TH INTERNATIONAL SCIENTIFIC CONFERENCE BUSINESS AND MANAGEMENT 2010, VOLS I AND II, 2010, : 635 - 642
  • [28] Cluster-Based Malicious Node Detection for False Downstream Data in Fog Computing-Based VANETs
    Gu, Ke
    Dong, XinYing
    Li, Xiong
    Jia, WeiJia
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (03): : 1245 - 1263
  • [29] Cluster-Based Joins for Federated SPARQL Queries
    Yang, Fan
    Crainiceanu, Adina
    Chen, Zhiyuan
    Needham, Don
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 3525 - 3539
  • [30] Cluster-based Magnetic Porous Coordination Polymers
    Xiang Sheng-Chang
    Wang Xin
    Hu Sheng-Min
    Sheng Tian-Lu
    CHINESE JOURNAL OF STRUCTURAL CHEMISTRY, 2009, 28 (11) : 1349 - 1358