Probabilistic Threshold Join over Distributed Uncertain Data

被引:0
|
作者
Deng, Lei [1 ]
Wang, Fei [1 ]
Huang, Benxiong [1 ]
机构
[1] Huazhong Univ Sci & Technol, Dept Elect & Informat Engn, Wuhan 430074, Peoples R China
来源
WEB-AGE INFORMATION MANAGEMENT | 2011年 / 6897卷
关键词
Distributed query processing; joins; uncertain data; Bloom filters;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large amount of uncertain data is collected by many emerging applications which contain multiple sources in a distributed manner. Previous efforts on querying uncertain data in distributed environment have only focus on ranking and skyline, join queries have not been addressed in earlier work despite their importance in databases. In this paper, we address distributed probabilistic threshold join query, which retrieves results satisfying the join condition with combining probabilities that meet the threshold requirement from distributed sites. We propose a new kind of bloom filters called Probability Bloom Filters (PBF) to represent set with probabilistic attribute and design a PBF based Bloomjoin algorithm for executing distributed probabilistic threshold join query with communication efficiency. Furthermore, we provide theoretical analysis of the network cost of our algorithm and demonstrate it by simulation. The experiment results show that our algorithm can save network cost efficiently by comparing to original Bloomjoin algorithm in most scenarios.
引用
收藏
页码:68 / 80
页数:13
相关论文
共 50 条
  • [31] Uncertain Data Clustering in Distributed Peer-to-Peer Networks
    Zhou, Jin
    Chen, Long
    Chen, C. L. Philip
    Wang, Yingxu
    Li, Han-Xiong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2392 - 2406
  • [32] Continuous monitoring of skylines over uncertain data streams
    Ding, Xiaofeng
    Lian, Xiang
    Chen, Lei
    Jin, Hai
    INFORMATION SCIENCES, 2012, 184 (01) : 196 - 214
  • [33] An EP-Topk Query over Uncertain Data
    Yang, Zhibang
    Zhou, Xu
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 577 - 580
  • [34] Extreme learning machine for classification over uncertain data
    Sun, Yongjiao
    Yuan, Ye
    Wang, Guoren
    NEUROCOMPUTING, 2014, 128 : 500 - 506
  • [35] Approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data
    Cuzzocrea, Alfredo
    Leung, Carson K.
    MacKinnon, Richard Kyle
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015, 2015, 60 : 613 - 622
  • [36] Probabilistic skylines on uncertain data: model and bounding-pruning-refining methods
    Bin Jiang
    Jian Pei
    Xuemin Lin
    Yidong Yuan
    Journal of Intelligent Information Systems, 2012, 38 : 1 - 39
  • [37] Skyline Query on Uncertain Data Based on Improved Probabilistic Constraint Space Algorithm
    Dong Liming
    Liu Qingbao
    Dai Changhua
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 929 - 932
  • [38] Probabilistic skylines on uncertain data: model and bounding-pruning-refining methods
    Jiang, Bin
    Pei, Jian
    Lin, Xuemin
    Yuan, Yidong
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (01) : 1 - 39
  • [39] Skip Search Approach for Mining Probabilistic Frequent Itemsets from Uncertain Data
    Shintani, Takahiko
    Ohmori, Tadashi
    Fujita, Hideyuki
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 174 - 180
  • [40] Efficiently Predicting Frequent Patterns over Uncertain Data Streams
    Liu, Chuan-Ming
    Liao, Kuan-Teng
    10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 15 - 22