Schemas and Types for JSON']JSON Data: from Theory to Practice

被引:13
作者
Baazizi, Mohamed-Amine [1 ]
Colazzo, Dario [2 ]
Ghelli, Giorgio [3 ]
Sartiani, Carlo [4 ]
机构
[1] Sorbonne Univ, LIP6 UMR 7606, Paris, France
[2] Univ Paris 09, PSL Res Univ, Paris, France
[3] Univ Pisa, Dipartimento Informat, Pisa, Italy
[4] Univ Basilicata, DIMIE, Potenza, Italy
来源
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2019年
关键词
!text type='JSON']JSON[!/text; schemas; schema inference; parsing; schema libraries;
D O I
10.1145/3299869.3314032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The last few years have seen the fast and ubiquitous diffusion of JSON as one of the most widely used formats for publishing and interchanging data, as it combines the flexibility of semistructured data models with well-known data structures like records and arrays. The user willing to effectively manage JSON data collections can rely on several schema languages, like JSON Schema, JSound, and Joi, as well as on the type abstractions offered by modern programming and scripting languages like Swift or TypeScript. The main aim of this tutorial is to provide the audience (both researchers and practitioners) with the basic notions for enjoying all the benefits that schema and types can offer while processing and manipulating JSON data. This tutorial focuses on four main aspects of the relation between JSON and schemas: (1) we survey existing schema language proposals and discuss their prominent features; (2) we analyze tools that can infer schemas from data, or that exploit schema information for improving data parsing and management; and (3) we discuss some open research challenges and opportunities related to JSON data.
引用
收藏
页码:2060 / 2063
页数:4
相关论文
共 11 条
  • [1] Parametric schema inference for massive JSON']JSON datasets
    Baazizi, Mohamed-Amine
    Colazzo, Dario
    Ghelli, Giorgio
    Sartiani, Carlo
    [J]. VLDB JOURNAL, 2019, 28 (04) : 497 - 521
  • [2] Counting Types for Massive JSON']JSON Datasets
    Baazizi, Mohamed-Amine
    Colazzo, Dario
    Ghelli, Giorgio
    Sartiani, Carlo
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL SYMPOSIUM ON DATABASE PROGRAMMING LANGUAGES (DBPL 2017), 2017,
  • [3] Ben Lahmar Houssem, 2017, EDBT 17
  • [4] Beyer KS, 2011, PROC VLDB ENDOW, V4, P1272
  • [5] FAD.js']js: Fast JSON']JSON Data Access Using JIT-based Speculative Optimizations
    Bonetta, Daniele
    Brantner, Matthias
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1778 - 1789
  • [6] JSON']JSON: Data model, Query languages and Schema specification
    Bourhis, Pierre
    Reutter, Juan L.
    Suarez, Fernando
    Vrgoc, Domagoj
    [J]. PODS'17: PROCEEDINGS OF THE 36TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2017, : 123 - 135
  • [7] Schema profiling of document-oriented databases
    Gallinucci, Enrico
    Golfarelli, Matteo
    Rizzi, Stefano
    [J]. INFORMATION SYSTEMS, 2018, 75 : 13 - 25
  • [8] Goods: Organizing Google's Datasets
    Halevy, Alon
    Korn, Flip
    Noy, Natalya F.
    Olston, Christopher
    Polyzotis, Neoklis
    Roy, Sudip
    Whang, Steven Euijong
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 795 - 806
  • [9] Mison: A Fast JSON']JSON Parser for Data Analytics
    Li, Yinan
    Katsipoulakis, Nikos R.
    Chandramouli, Badrish
    Goldstein, Jonathan
    Kossmann, Donald
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (10): : 1118 - 1129
  • [10] Foundations of JSON']JSON Schema
    Pezoa, Felipe
    Reutter, Juan L.
    Suarez, Fernando
    Ugarte, Martin
    Vrgoc, Domagoj
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 263 - 273