VOYAGE: A Large Collection of Vocabulary Usage in Open RDF Datasets

被引:0
作者
Shi, Qing [1 ]
Wang, Junrui [1 ]
Pan, Jeff Z. [2 ]
Cheng, Gong [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Univ Edinburgh, Sch Informat, Edinburgh, Scotland
来源
SEMANTIC WEB, ISWC 2023, PT II | 2023年 / 14266卷
关键词
Open RDF data; Vocabulary usage; Term co-occurrence; RELATEDNESS;
D O I
10.1007/978-3-031-47243-5_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Shared vocabularies facilitate data integration and application interoperability on the Semantic Web. An investigation of how vocabularies are practically used in open RDF data, particularly with the increasing number of RDF datasets registered in open data portals, is expected to provide a measurement for the adoption of shared vocabularies and an indicator of the state of the Semantic Web. To support this investigation, we constructed and published VOYAGE, a large collection of vocabulary usage in open RDF datasets. We built it by collecting 68,312 RDF datasets from 517 pay-level domains via 577 open data portals, and we extracted 50,976 vocabularies used in the data. We analyzed the extracted usage data and revealed the distributions of frequency and diversity in vocabulary usage. We particularly characterized the patterns of term co-occurrence, and leveraged them to cluster vocabularies and RDF datasets as a potential application of VOYAGE. Our data is available from Zenodo at https://zenodo.org/record/7902675. Our code is available from GitHub at https://github.com/nju-websoft/VOYAGE.
引用
收藏
页码:211 / 229
页数:19
相关论文
共 36 条
[1]   A survey of RDF stores & SPARQL engines for querying knowledge graphs [J].
Ali, Waqas ;
Saleem, Muhammad ;
Yao, Bin ;
Hogan, Aidan ;
Ngomo, Axel-Cyrille Ngonga .
VLDB JOURNAL, 2022, 31 (03) :603-628
[2]   Analysing the Use of Ontologies based on Usage Network [J].
Ashraf, Jamshaid ;
Hussain, Omar Khadeer .
2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, :540-544
[3]   Observing LOD Using Equivalent Set Graphs: It Is Mostly Flat and Sparsely Linked [J].
Asprino, Luigi ;
Beek, Wouter ;
Ciancarini, Paolo ;
van Harmelen, Frank ;
Presutti, Valentina .
SEMANTIC WEB - ISWC 2019, PT I, 2019, 11778 :57-74
[4]  
Bizer C, 2013, LECT NOTES COMPUT SC, V8219, P17, DOI 10.1007/978-3-642-41338-4_2
[5]  
Cheng G., 2013, Semantic Web and Web Science, P265, DOI [10.1007/978-1-4614-6880-623, DOI 10.1007/978-1-4614-6880-623]
[6]   Relatedness between vocabularies on the Web of data: A taxonomy and an empirical study [J].
Cheng, Gong ;
Qu, Yuzhong .
JOURNAL OF WEB SEMANTICS, 2013, 20 :1-17
[7]  
Cheng G, 2011, LECT NOTES COMPUT SC, V7031, P98, DOI 10.1007/978-3-642-25073-6_7
[8]  
Dividino R.Q., 2013, COLD 2013
[9]  
Gottron Thomas, 2013, Semantic Web: Semantics and Big Data. Proceedings of 10th International Conference (ESWC 2013): LNCS 7882, P228
[10]   Analysis of schema structures in the Linked Open Data graph based on unique subject URIs, pay-level domains, and vocabulary usage [J].
Gottron, Thomas ;
Knauf, Malte ;
Scherp, Ansgar .
DISTRIBUTED AND PARALLEL DATABASES, 2015, 33 (04) :515-553