On efficient conditioning of probabilistic relational databases

被引:5
|
作者
Zhu, Hong [1 ]
Zhang, Caicai [1 ]
Cao, Zhongsheng [1 ]
Tang, Ruiming [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Hong Kong, Peoples R China
关键词
Probabilistic databases; Possible worlds; Conditioning; Functional dependency; Constraints; CLEANING UNCERTAIN DATA;
D O I
10.1016/j.knosys.2015.10.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A probabilistic relational database is a probability distribution over a set of deterministic relational databases (namely, possible worlds). Efficient updating information in probabilistic databases is required in several applications, such as sensor networking and data cleaning. As a way to update a probabilistic database, conditioning refines the probability distribution of the possible worlds based on general knowledge, such as functional dependencies. The existing methods for conditioning are exponential over the number of variables in the probabilistic database for an arbitrary constraint. In this paper, a constraint-based conditioning framework is proposed, which solves the conditioning problem by considering only the variables in the given constraint. Then, we prove the correctness of our proposed approach and provide efficient algorithms for each step of the approach. Afterward, a pruning strategy that can significantly improve the efficiency of the constraint-based approach is proposed for the functional dependency constraints. Furthermore, for functional dependency constraints, a variable-elimination strategy that minimizes the number of generated variables can benefit the subsequent query processing. The experimental study shows that the constraint based approach is more efficient than other approaches described in the literature. The effectiveness of the two optimization strategies for functional dependency constraints is also demonstrated in the experiment. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:112 / 126
页数:15
相关论文
共 50 条
  • [41] Efficient execution of aggregation queries over encrypted relational databases
    Hacigümüs, H
    Iyer, B
    Mehrotra, S
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 125 - 136
  • [42] A SECURE AND EFFICIENT ORDER PRESERVING ENCRYPTION SCHEME FOR RELATIONAL DATABASES
    Kadhem, Hasan
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    KMIS 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE MANAGEMENT AND INFORMATION SHARING, 2010, : 25 - 35
  • [43] Progressive ranking for efficient keyword search over relational databases
    Li, Guoliang
    Feng, Jianhua
    Lin, Feng
    Zhou, Lizhu
    SHARING DATA, INFORMATION AND KNOWLEDGE, PROCEEDINGS, 2008, 5071 : 193 - 197
  • [44] OBJECT DATABASES AS GENERALIZATIONS OF RELATIONAL DATABASES
    BEECH, D
    OZBUTUN, C
    COMPUTER STANDARDS & INTERFACES, 1991, 13 (1-3) : 221 - 230
  • [45] Efficient Queries Evaluation on Block Independent Disjoint Probabilistic Databases
    Qin, Biao
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015, PT II, 2015, 9050 : 74 - 88
  • [46] Ranking Query Answers in Probabilistic Databases: Complexity and Efficient Algorithms
    Olteanu, Dan
    Wen, Hongkai
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 282 - 293
  • [47] Efficient weighted probabilistic frequent itemset mining in uncertain databases
    Li, Zhiyang
    Chen, Fengjuan
    Wu, Junfeng
    Liu, Zhaobin
    Liu, Weijiang
    EXPERT SYSTEMS, 2021, 38 (05)
  • [48] Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases
    Yuan, Ye
    Wang, Guoren
    Chent, Lei
    Wang, Haixun
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (09): : 800 - 811
  • [49] Using Probabilistic Relational Models to Generate Synthetic Spatial or Non-spatial Databases
    Chulyadyo, Rajani
    Leray, Philippe
    2018 12TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2018,
  • [50] Efficient Multiple Query Answering in Switched Probabilistic Relational Models
    Gehrke, Marcel
    Braun, Tanya
    Moeller, Ralf
    AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 104 - 116