On efficient conditioning of probabilistic relational databases

被引:5
|
作者
Zhu, Hong [1 ]
Zhang, Caicai [1 ]
Cao, Zhongsheng [1 ]
Tang, Ruiming [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, 1037 Luoyu Rd, Wuhan 430074, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Hong Kong, Peoples R China
关键词
Probabilistic databases; Possible worlds; Conditioning; Functional dependency; Constraints; CLEANING UNCERTAIN DATA;
D O I
10.1016/j.knosys.2015.10.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A probabilistic relational database is a probability distribution over a set of deterministic relational databases (namely, possible worlds). Efficient updating information in probabilistic databases is required in several applications, such as sensor networking and data cleaning. As a way to update a probabilistic database, conditioning refines the probability distribution of the possible worlds based on general knowledge, such as functional dependencies. The existing methods for conditioning are exponential over the number of variables in the probabilistic database for an arbitrary constraint. In this paper, a constraint-based conditioning framework is proposed, which solves the conditioning problem by considering only the variables in the given constraint. Then, we prove the correctness of our proposed approach and provide efficient algorithms for each step of the approach. Afterward, a pruning strategy that can significantly improve the efficiency of the constraint-based approach is proposed for the functional dependency constraints. Furthermore, for functional dependency constraints, a variable-elimination strategy that minimizes the number of generated variables can benefit the subsequent query processing. The experimental study shows that the constraint based approach is more efficient than other approaches described in the literature. The effectiveness of the two optimization strategies for functional dependency constraints is also demonstrated in the experiment. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:112 / 126
页数:15
相关论文
共 50 条
  • [1] An Efficient Conditioning Method for Probabilistic Relational Databases
    Zhu, Hong
    Zhang, Caicai
    Cao, Zhongsheng
    Tang, Ruiming
    Yang, Mengyuan
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 225 - 236
  • [2] An Efficient Initialization Method for Probabilistic Relational Databases
    Zhu, Hong
    Zhang, Caicai
    Cao, Zhongsheng
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2016, PT II, 2016, 9828 : 454 - 462
  • [3] Conditioning Probabilistic Databases
    Koch, Christoph
    Olteanu, Dan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 313 - 325
  • [4] Probabilistic ontologies and relational databases
    Udrea, O
    Yu, D
    Hung, E
    Subrahmanian, VS
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2005: COOPIS, DOA, AND ODBASE, PT 1, PROCEEDINGS, 2005, 3760 : 1 - 17
  • [5] Query with Assumptions for Probabilistic Relational Databases
    Zhang, Caicai
    Mei, Zhuolin
    Wu, Bin
    Zhao, Zhiqiang
    Yu, Jing
    Wang, Qingqing
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (03): : 923 - 932
  • [6] Query evaluation in probabilistic relational databases
    Zimanyi, E
    THEORETICAL COMPUTER SCIENCE, 1997, 171 (1-2) : 179 - 219
  • [7] Query Optimization Strategies in Probabilistic Relational Databases
    Zhang, Caicai
    Cao, Zhongsheng
    Zhu, Hong
    THEORETICAL COMPUTER SCIENCE, NCTCS 2017, 2017, 768 : 208 - 220
  • [8] On the Division Operator for Probabilistic and Possibilistic Relational Databases
    Bosc, Patrick
    Pivert, Olivier
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 110 - 116
  • [9] Conditioning Probabilistic Relational Data with Referential Constraints
    Tang, Ruiming
    Shao, Dongxu
    Ba, M. Lamine
    Wu, Huayu
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, 2014, 8505 : 413 - 427
  • [10] An efficient join for nested relational databases
    Liu, HC
    Chirathamjaree, C
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 1996, 1134 : 289 - 301