Limit Datalog: A Declarative Query Language for Data Analysis

被引:0
|
作者
Grau, Bernardo Cuenca [1 ]
Horrocks, Ian [1 ]
Kaminski, Mark [1 ]
Kostylev, Egor, V [1 ]
Motik, Boris [1 ]
机构
[1] Univ Oxford, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
AGGREGATION; COMPLEXITY; SEMANTICS; POWER;
D O I
10.1145/3385658.3385660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Motivated by applications in declarative data analysis, we study Datalog(Z)-an extension of Datalog with stratified negation and arithmetics over integers. Reasoning in this language is undecidable, so we present a fragment, called limit Datalog(Z), that is powerful enough to naturally capture many important data analysis tasks. In limit Datalog(Z), all intensional predicates with a numeric argument are limit predicates that keep only the maximal or minimal bounds on numeric values. Reasoning in limit Datalog(Z) is decidable if multiplication is used in a way that satisfies our linearity condition. Moreover, fact entailment in limit-linear Datalog(Z) is Delta(EXP)(2)-complete in combined and Delta(P)(2)-complete in data complexity, and it drops to coNEXP and coNP, respectively, if only (semi-)positive programs are considered. We also propose an additional stability requirement, for which the complexity drops to EXP and P, matching the bounds for usual Datalog. Limit Datalog(Z) thus provides us with a unified logical framework for declarative data analysis and can be used as a basis for understanding the expressive power of the key data analysis constructs.
引用
收藏
页码:6 / 17
页数:12
相关论文
共 50 条
  • [21] Causes for query answers from databases: Datalog abduction, view-updates, and integrity constraints
    Bertossi, Leopoldo
    Salimi, Babak
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 90 : 226 - 252
  • [22] Datalog plus /-: A Family of Logical Knowledge Representation and Query Languages for New Applications Keynote Lecture
    Cali, Andrea
    Gottlob, Georg
    Lukasiewicz, Thomas
    Marnette, Bruno
    Pieris, Andreas
    25TH ANNUAL IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE (LICS 2010), 2010, : 228 - 242
  • [23] NP Datalog: A logic language for expressing NP search and optimization problems
    Greco, Sergio
    Molinaro, Cristian
    Trubitsyna, Irina
    Zumpano, Ester
    THEORY AND PRACTICE OF LOGIC PROGRAMMING, 2010, 10 : 125 - 166
  • [24] SociaLite: Datalog Extensions for Efficient Social Network Analysis
    Seo, Jiwon
    Guo, Stephen
    Lam, Monica S.
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 278 - 289
  • [25] Demonstration of LogicLib: An Expressive Multi-Language Interface over Scalable Datalog System
    Li, Mingda
    Wang, Jin
    Xiao, Guorui
    Li, Youfu
    Zaniolo, Carlo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4917 - 4920
  • [26] RQL: A Query Language for Rule Discovery in Databases
    Chardin, Brice
    Coquery, Emmanuel
    Pailloux, Marie
    Petit, Jean-Marc
    THEORETICAL COMPUTER SCIENCE, 2017, 658 : 357 - 374
  • [27] Translational Semantics for a Conceptual Level Query Language
    $Hock C. Chan(Department of Information Systems and Computer Science
    JournalofComputerScienceandTechnology, 1995, (02) : 175 - 187
  • [28] A Query Language Based on Term Matching and Rewriting
    Zielinski, Bartosz
    FUNDAMENTA INFORMATICAE, 2019, 169 (03) : 237 - 274
  • [29] A model and query language for temporal graph databases
    Debrouvier, Ariel
    Parodi, Eliseo
    Perazzo, Matias
    Soliani, Valeria
    Vaisman, Alejandro
    VLDB JOURNAL, 2021, 30 (05) : 825 - 858
  • [30] A Language-Independent Framework for Reasoning About Preferences for Declarative Problem Solving
    Ensan, Alireza
    Ternovska, Eugenia
    FRONTIERS OF COMBINING SYSTEMS (FROCOS 2019), 2019, 11715 : 57 - 73