An Approach for Schema Extraction of NoSQL Graph Databases

被引:8
作者
Frozza, Angelo Augusto [1 ]
Jacinto, Salomao Rodrigues [2 ]
Mello, Ronaldo dos Santos [2 ]
机构
[1] Inst Fed Catarinense IFC, Blumenau, SC, Brazil
[2] Univ Fed Santa Catarina UFSC, Programa Posgrad Ciencia Comp PPGCC, Florianopolis, SC, Brazil
来源
2020 IEEE 21ST INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2020) | 2020年
关键词
D O I
10.1109/IRI49571.2020.00046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, a large volume of heterogeneous data is generated and consumed by several classes of applications, which raise a new family of database models called NoSQL. NoSQL graph databases is a member of this family. They provide high scalability and are schemaless, i.e., they do not require an implicit schema such as relational databases. However, the knowledge of how data is structured may be of great importance for data integration or data analysis processes. There are some works in the literature that extract the schema from graph structures or graph-based data sources. Different from them, this work proposes a comprehensive approach that consider all the common NoSQL database graph data model concepts, and generates a schema in the recent JSON Schema recommendation. Experimental evaluations show that our solution generates a suitable schema representation with a linear complexity.
引用
收藏
页码:271 / 278
页数:8
相关论文
共 15 条
[1]  
Angles R., 2012, Proceedings of the 2012 IEEE International Conference on Data Engineering Workshops (ICDEW 2012), P171, DOI 10.1109/ICDEW.2012.31
[2]  
[Anonymous], 2015, GRAPH DATABASES
[3]  
Belfkih S., 2013, INT J DATABASE THEOR, V6
[4]   Extracting Fuzzy Summaries from NoSQL Graph Databases [J].
Castelltort, Arnaud ;
Laurent, Anne .
FLEXIBLE QUERY ANSWERING SYSTEMS 2015, 2016, 400 :189-200
[5]  
Comyn-Wattiau I, 2017, IEEE INT CONF BIG DA, P453, DOI 10.1109/BigData.2017.8257957
[6]  
DONG XL, 2015, BIG DATA INTEGRATION
[7]   An Approach for Schema Extraction of JSON']JSON and Extended JSON']JSON Document Collections [J].
Frozza, Angelo Augusto ;
Mello, Ronaldo dos Santos ;
da Costa, Felipe de Souza .
2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, :356-363
[8]  
Ji Q., 2018, 3 CHIN C KNOWL GRAPH, V957, P136
[9]  
Klettke U., 2015, P C DAT SYST BUS TEC, P425
[10]   Data Lakes: Trends and Perspectives [J].
Ravat, Franck ;
Zhao, Yan .
DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT I, 2019, 11706 :304-313