Schemas and Types for JSON']JSON Data: from Theory to Practice

被引:14
作者
Baazizi, Mohamed-Amine [1 ]
Colazzo, Dario [2 ]
Ghelli, Giorgio [3 ]
Sartiani, Carlo [4 ]
机构
[1] Sorbonne Univ, LIP6 UMR 7606, Paris, France
[2] Univ Paris 09, PSL Res Univ, Paris, France
[3] Univ Pisa, Dipartimento Informat, Pisa, Italy
[4] Univ Basilicata, DIMIE, Potenza, Italy
来源
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2019年
关键词
!text type='JSON']JSON[!/text; schemas; schema inference; parsing; schema libraries;
D O I
10.1145/3299869.3314032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The last few years have seen the fast and ubiquitous diffusion of JSON as one of the most widely used formats for publishing and interchanging data, as it combines the flexibility of semistructured data models with well-known data structures like records and arrays. The user willing to effectively manage JSON data collections can rely on several schema languages, like JSON Schema, JSound, and Joi, as well as on the type abstractions offered by modern programming and scripting languages like Swift or TypeScript. The main aim of this tutorial is to provide the audience (both researchers and practitioners) with the basic notions for enjoying all the benefits that schema and types can offer while processing and manipulating JSON data. This tutorial focuses on four main aspects of the relation between JSON and schemas: (1) we survey existing schema language proposals and discuss their prominent features; (2) we analyze tools that can infer schemas from data, or that exploit schema information for improving data parsing and management; and (3) we discuss some open research challenges and opportunities related to JSON data.
引用
收藏
页码:2060 / 2063
页数:4
相关论文
共 11 条
[1]   Parametric schema inference for massive JSON']JSON datasets [J].
Baazizi, Mohamed-Amine ;
Colazzo, Dario ;
Ghelli, Giorgio ;
Sartiani, Carlo .
VLDB JOURNAL, 2019, 28 (04) :497-521
[2]   Counting Types for Massive JSON']JSON Datasets [J].
Baazizi, Mohamed-Amine ;
Colazzo, Dario ;
Ghelli, Giorgio ;
Sartiani, Carlo .
PROCEEDINGS OF THE 16TH INTERNATIONAL SYMPOSIUM ON DATABASE PROGRAMMING LANGUAGES (DBPL 2017), 2017,
[3]  
Ben Lahmar Houssem, 2017, EDBT 17
[4]  
Beyer KS, 2011, PROC VLDB ENDOW, V4, P1272
[5]   FAD.js']js: Fast JSON']JSON Data Access Using JIT-based Speculative Optimizations [J].
Bonetta, Daniele ;
Brantner, Matthias .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12) :1778-1789
[6]   JSON']JSON: Data model, Query languages and Schema specification [J].
Bourhis, Pierre ;
Reutter, Juan L. ;
Suarez, Fernando ;
Vrgoc, Domagoj .
PODS'17: PROCEEDINGS OF THE 36TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2017, :123-135
[7]   Schema profiling of document-oriented databases [J].
Gallinucci, Enrico ;
Golfarelli, Matteo ;
Rizzi, Stefano .
INFORMATION SYSTEMS, 2018, 75 :13-25
[8]   Goods: Organizing Google's Datasets [J].
Halevy, Alon ;
Korn, Flip ;
Noy, Natalya F. ;
Olston, Christopher ;
Polyzotis, Neoklis ;
Roy, Sudip ;
Whang, Steven Euijong .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :795-806
[9]   Mison: A Fast JSON']JSON Parser for Data Analytics [J].
Li, Yinan ;
Katsipoulakis, Nikos R. ;
Chandramouli, Badrish ;
Goldstein, Jonathan ;
Kossmann, Donald .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (10) :1118-1129
[10]   Foundations of JSON']JSON Schema [J].
Pezoa, Felipe ;
Reutter, Juan L. ;
Suarez, Fernando ;
Ugarte, Martin ;
Vrgoc, Domagoj .
PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, :263-273