Query evaluation over probabilistic XML

被引:21
|
作者
Kimelfeld, Benny [1 ]
Kosharovsky, Yuri [2 ]
Sagiv, Yehoshua [2 ]
机构
[1] IBM Almaden Res Ctr, San Jose, CA USA
[2] Hebrew Univ Jerusalem, Jerusalem, Israel
来源
VLDB JOURNAL | 2009年 / 18卷 / 05期
基金
以色列科学基金会;
关键词
Probabilistic databases; Probabilistic XML; Query processing; Query optimization; Approximate query evaluation; INFORMATION; ALGORITHMS; COMPLEXITY; ALGEBRA; MODEL; HARD;
D O I
10.1007/s00778-009-0150-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Query evaluation over probabilistic XML is explored. The queries are twig patterns with projection, and the data is represented in terms of three models of probabilistic XML (that extend existing ones in the literature). The first model makes an assumption of independence among the probabilistic junctions, whereas the second model can encode probabilistic dependencies. The third model combines the first two and, hence, is the most general. An efficient algorithm (under data complexity) is given for query evaluation in the first model. In addition, various optimizations are proposed, and their effectiveness is shown both analytically and experimentally. For the other two models, it is shown that every query is either intractable or trivial. Nonetheless, efficient (additive and multiplicative) approximation algorithms are given for these two models. Finally, Boolean queries are enriched by allowing disjunctions and negations of branches. The above algorithm for the first model is extended to handle these queries. For the other two models, there is an efficient additive approximation, and a multiplicative one also exists if there is no negation; in addition, it is shown that if the query is non-monotonic, then no efficient multiplicative approximation exists unless NP = RP.
引用
收藏
页码:1117 / 1140
页数:24
相关论文
共 50 条
  • [31] Efficient processing of top-k twig queries over probabilistic XML data
    Bo Ning
    Chengfei Liu
    Jeffrey Xu Yu
    World Wide Web, 2013, 16 : 299 - 323
  • [32] A Query Engine for Probabilistic Preferences
    Cohen, Uzi
    Kenig, Batya
    Ping, Haoyue
    Kimelfeld, Benny
    Stoyanovich, Julia
    SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1509 - 1524
  • [33] Efficient processing of top-k twig queries over probabilistic XML data
    Ning, Bo
    Liu, Chengfei
    Yu, Jeffrey Xu
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2013, 16 (03): : 299 - 323
  • [34] An Effective Distributed Framework for XML Query Processing
    Subramaniam, Samini
    Haw, Su-Cheng
    Soon, Lay-Ki
    ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1240 - 1243
  • [35] An Optimization Method Based on XML Query Algebra
    Zhang, Qiuyu
    Wang, Min
    SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 1, 2012, 114 : 199 - 206
  • [36] Web/XML data management and query processing
    Zhou, AY
    Zheng, SH
    Qian, WN
    WORLD WIDE WEB TECHNOLOGIES IN CHINA: RESEARCH, DEVELOPMENT, AND APPLICATIONS, 2002, : 95 - 115
  • [37] A Deterministic Approach to XML Query Processing with Efficient Support for Pure and Negated Containments
    Che, Dunren
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2006, 1 (04) : 49 - 67
  • [38] Minimum Tree Edit distance between XML and Probabilistic XML Documents
    Ma, Haitao
    Xu, Changming
    Fang, Miao
    Yu, Changyong
    2014 IEEE WORKSHOP ON ELECTRONICS, COMPUTER AND APPLICATIONS, 2014, : 391 - 394
  • [39] Explaining Query Answers in Probabilistic Databases
    Debbi, Hichem
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 140 - 152
  • [40] Query with Assumptions for Probabilistic Relational Databases
    Zhang, Caicai
    Mei, Zhuolin
    Wu, Bin
    Zhao, Zhiqiang
    Yu, Jing
    Wang, Qingqing
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (03): : 923 - 932