Information Inequality Problem over Set Functions

被引:0
作者
Hannula, Miika [1 ]
机构
[1] Univ Helsinki, Helsinki, Finland
来源
27TH INTERNATIONAL CONFERENCE ON DATABASE THEORY, ICDT 2024 | 2024年 / 290卷
基金
欧洲研究理事会;
关键词
entropy; information theory; worst-case output size; computational complexity; RELATIONAL DATABASES; THEORETIC ANALYSIS;
D O I
10.4230/LIPIcs.ICDT.2024.19
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Information inequalities appear in many database applications such as query output size bounds, query containment, and implication between data dependencies. Recently Khamis et al. [14] proposed to study the algorithmic aspects of information inequalities, including the information inequality problem: decide whether a linear inequality over entropies of random variables is valid. While the decidability of this problem is a major open question, applications often involve only inequalities that adhere to specific syntactic forms linked to useful semantic invariance properties. This paper studies the information inequality problem in different syntactic and semantic scenarios that arise from database applications. Focusing on the boundary between tractability and intractability, we show that the information inequality problem is coNP-complete if restricted to normal polymatroids, and in polynomial time if relaxed to monotone functions. We also examine syntactic restrictions related to query output size bounds, and provide an alternative proof, through monotone functions, for the polynomial-time computability of the entropic bound over simple sets of degree constraints.
引用
收藏
页数:20
相关论文
共 28 条
[1]   An information-theoretic approach to normal forms for relational and XML data [J].
Arenas, M ;
Libkin, L .
JOURNAL OF THE ACM, 2005, 52 (02) :246-283
[2]   SIZE BOUNDS AND QUERY PLANS FOR RELATIONAL JOINS [J].
Atserias, Albert ;
Grohe, Martin ;
Marx, Daniel .
SIAM JOURNAL ON COMPUTING, 2013, 42 (04) :1737-1767
[3]  
Beeri C., 1977, P 1977 ACM SIGMOD IN, P47, DOI DOI 10.1145/509404.509414
[4]  
Dougherty R, 2011, Arxiv, DOI arXiv:1104.3602
[5]   COMPLEXITY OF AUTOMATON IDENTIFICATION FROM GIVEN DATA [J].
GOLD, EM .
INFORMATION AND CONTROL, 1978, 37 (03) :302-320
[6]   Size and Treewidth Bounds for Conjunctive Queries [J].
Gottlob, Georg ;
Lee, Stephanie Tien ;
Valiant, Gregory ;
Valiant, Paul .
JOURNAL OF THE ACM, 2012, 59 (03)
[7]   Constraint Solving via Fractional Edge Covers [J].
Grohe, Martin ;
Marx, Daniel .
ACM TRANSACTIONS ON ALGORITHMS, 2014, 11 (01)
[8]  
Gurpinar E, 2019, IEEE INT SYMP INFO, P1377, DOI [10.1109/isit.2019.8849309, 10.1109/ISIT.2019.8849309]
[9]  
Hannula Miika, 2023, arXiv, DOI [10.48550/arXiv.2309.11818, DOI 10.48550/ARXIV.2309.11818]
[10]   ON THE UNDECIDABILITY OF IMPLICATIONS BETWEEN EMBEDDED MULTIVALUED DATABASE DEPENDENCIES [J].
HERRMANN, C .
INFORMATION AND COMPUTATION, 1995, 122 (02) :221-235