Exact Model Counting of Query Expressions: Limitations of Propositional Methods

被引:9
作者
Beame, Paul [1 ]
Li, Jerry [2 ]
Roy, Sudeepa [3 ]
Suciu, Dan [1 ]
机构
[1] Univ Washington, Dept Comp Sci & Engn, Box 352350, Seattle, WA 98195 USA
[2] MIT, Dept Elect Engn & Comp Sci, 32 Vassar St, Cambridge, MA 02141 USA
[3] Duke Univ, Dept Comp Sci, Campus Box 90129,308 Res Dr, Durham, NC 27708 USA
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2017年 / 42卷 / 01期
基金
美国国家科学基金会;
关键词
COMPLEXITY;
D O I
10.1145/2984632
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We prove exponential lower bounds on the running time of the state-of-the-art exact model counting algorithms-algorithms for exactly computing the number of satisfying assignments, or the satisfying probability, of Boolean formulas. These algorithms can be seen, either directly or indirectly, as building Decision-Decomposable Negation Normal Form (decision-DNNF) representations of the input Boolean formulas. Decision-DNNFs are a special case of d-DNNFs where d stands for deterministic. We show that any knowledge compilation representations from a class (called DLDDs in this article) that contain decisionDNNFs can be converted into equivalent Free Binary Decision Diagrams (FBDDs), also known as Read-Once Branching Programs, with only a quasi-polynomial increase in representation size. Leveraging known exponential lower bounds for FBDDs, we then obtain similar exponential lower bounds for decision-DNNFs, which imply exponential lower bounds for model-counting algorithms. We also separate the power of decisionDNNFs from d-DNNFs and a generalization of decision-DNNFs known as AND-FBDDs. We then prove new lower bounds for FBDDs that yield exponential lower bounds on the running time of these exact model counters when applied to the problem of query evaluation in tuple-independent probabilistic databases-computing the probability of an answer to a query given independent probabilities of the individual tuples in a database instance. This approach to the query evaluation problem, in which one first obtains the lineage for the query and database instance as a Boolean formula and then performs weighted model counting on the lineage, is known as grounded inference. A second approach, known as lifted inference or extensional query evaluation, exploits the high-level structure of the query as a first-order formula. Although it has been widely believed that lifted inference is strictly more powerful than grounded inference on the lineage alone, no formal separation has previously been shown for query evaluation. In this article, we show such a formal separation for the first time. In particular, we exhibit a family of database queries for which polynomial-time extensional query evaluation techniques were previously known but for which query evaluation via grounded inference using the state-of-the-art exact model counters requires exponential time.
引用
收藏
页数:46
相关论文
共 41 条
  • [1] AKERS SB, 1978, IEEE T COMPUT, V27, P509, DOI 10.1109/TC.1978.1675141
  • [2] [Anonymous], 2014, SDD PACKAGE VERSION
  • [3] [Anonymous], 2011, ser. Synthesis Lectures on Data Management
  • [4] [Anonymous], 2004, SAT
  • [5] [Anonymous], 2000, SIAM MONOG DISCR MAT, DOI 10.1137/1.9780898719789
  • [6] [Anonymous], KR
  • [7] [Anonymous], 1997, Enumerative Combinatorics
  • [8] Algorithms and complexity results for #SAT and Bayesian inference
    Bacchus, F
    Dalmao, S
    Pitassi, T
    [J]. 44TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 2003, : 340 - 351
  • [9] Bayardo RJ, 2000, SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), P157
  • [10] Towards understanding and harnessing the potential of clause learning
    Beame, P
    Kautz, H
    Sabharwal, A
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 : 319 - 351