Query evaluation over probabilistic XML

被引:21
|
作者
Kimelfeld, Benny [1 ]
Kosharovsky, Yuri [2 ]
Sagiv, Yehoshua [2 ]
机构
[1] IBM Almaden Res Ctr, San Jose, CA USA
[2] Hebrew Univ Jerusalem, Jerusalem, Israel
来源
VLDB JOURNAL | 2009年 / 18卷 / 05期
基金
以色列科学基金会;
关键词
Probabilistic databases; Probabilistic XML; Query processing; Query optimization; Approximate query evaluation; INFORMATION; ALGORITHMS; COMPLEXITY; ALGEBRA; MODEL; HARD;
D O I
10.1007/s00778-009-0150-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Query evaluation over probabilistic XML is explored. The queries are twig patterns with projection, and the data is represented in terms of three models of probabilistic XML (that extend existing ones in the literature). The first model makes an assumption of independence among the probabilistic junctions, whereas the second model can encode probabilistic dependencies. The third model combines the first two and, hence, is the most general. An efficient algorithm (under data complexity) is given for query evaluation in the first model. In addition, various optimizations are proposed, and their effectiveness is shown both analytically and experimentally. For the other two models, it is shown that every query is either intractable or trivial. Nonetheless, efficient (additive and multiplicative) approximation algorithms are given for these two models. Finally, Boolean queries are enriched by allowing disjunctions and negations of branches. The above algorithm for the first model is extended to handle these queries. For the other two models, there is an efficient additive approximation, and a multiplicative one also exists if there is no negation; in addition, it is shown that if the query is non-monotonic, then no efficient multiplicative approximation exists unless NP = RP.
引用
收藏
页码:1117 / 1140
页数:24
相关论文
共 50 条
  • [1] Query evaluation over probabilistic XML
    Benny Kimelfeld
    Yuri Kosharovsky
    Yehoshua Sagiv
    The VLDB Journal, 2009, 18 : 1117 - 1140
  • [2] Incorporating Constraints in Probabilistic XML
    Cohen, Sara
    Kimelfeld, Benny
    Sagiv, Yehoshua
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (03):
  • [3] Quasi-SLCA Based Keyword Query Processing over Probabilistic XML Data
    Li, Jianxin
    Liu, Chengfei
    Zhou, Rui
    Yu, Jeffrey Xu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (04) : 957 - 969
  • [4] The complexity of XPath query evaluation and XML typing
    Gottlob, G
    Koch, C
    Pichler, R
    JOURNAL OF THE ACM, 2005, 52 (02) : 284 - 335
  • [5] Probabilistic query answering over inconsistent databases
    Greco, Sergio
    Molinaro, Cristian
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2012, 64 (2-3) : 185 - 207
  • [6] Efficient probabilistic XML query processing using an extended labeling scheme and a lightweight index
    Yun, Jung-Hee
    Chung, Chin-Wan
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (06) : 1181 - 1202
  • [7] On the expressiveness of probabilistic XML models
    Abiteboul, Serge
    Kimelfeld, Benny
    Sagiv, Yehoshua
    Senellart, Pierre
    VLDB JOURNAL, 2009, 18 (05): : 1041 - 1064
  • [8] Query answering over inconsistent knowledge bases: A probabilistic approach
    Calautti, Marco
    Greco, Sergio
    Molinaro, Cristian
    Trubitsyna, Irina
    THEORETICAL COMPUTER SCIENCE, 2022, 935 : 144 - 173
  • [9] Query Evaluation on Probabilistic RDF Databases
    Huang, Hai
    Liu, Chengfei
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2009, PROCEEDINGS, 2009, 5802 : 307 - 320
  • [10] Query Optimization Strategies in Probabilistic Relational Databases
    Zhang, Caicai
    Cao, Zhongsheng
    Zhu, Hong
    THEORETICAL COMPUTER SCIENCE, NCTCS 2017, 2017, 768 : 208 - 220