Bash Datalog: Answering Datalog Queries with Unix Shell Commands

被引:1
作者
Rebele, Thomas [1 ]
Tanon, Thomas Pellissier [1 ]
Suchanek, Fabian [1 ]
机构
[1] Telecom ParisTech, Paris, France
来源
SEMANTIC WEB - ISWC 2018, PT I | 2018年 / 11136卷
关键词
SYSTEM;
D O I
10.1007/978-3-030-00671-6_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dealing with large tabular datasets often requires extensive preprocessing. This preprocessing happens only once, so that loading and indexing the data in a database or triple store may be an overkill. In this paper, we present an approach that allows preprocessing large tabular data in Datalog - without indexing the data. The Datalog query is translated to Unix Bash and can be executed in a shell. Our experiments show that, for the use case of data preprocessing, our approach is competitive with state-of-the-art systems in terms of scalability and speed, while at the same time requiring only a Bash shell on a Unix system.
引用
收藏
页码:566 / 582
页数:17
相关论文
共 37 条
  • [1] Abiteboul S, 1995, FDN DATABASES
  • [2] Aho A. V., 1979, ACM S PRINC PROGR LA
  • [3] [Anonymous], 2016, SIGMOD
  • [4] [Anonymous], 2007, WWW
  • [5] [Anonymous], 2010, P INT C HOT TOP CLOU
  • [6] [Anonymous], 2012, SIGMOD
  • [7] Bittorf M., 2015, CIDR
  • [8] Breaking the Memory Wall in MonetDB
    Boncz, Peter A.
    Kersten, Martin L.
    Manegold, Stefan
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (12) : 77 - 85
  • [9] Carbone P., 2015, Bull. IEEE Comput. Soc. Tech. Committee Data Eng., V36, P28, DOI DOI 10.1109/IC2EW.2016.56
  • [10] Carroll J. J., 2004, WWW