Asynchronous Data Provenance for Research Data in a Distributed System

被引:0
作者
Heinrichs, Benedikt [1 ]
Politze, Marius [1 ]
机构
[1] Rhein Westfal TH Aachen, IT Ctr, Seffenter Weg 23, Aachen, Germany
来源
ICEIS: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 2 | 2021年
关键词
Research Data Management; Data Provenance; Distributed Systems;
D O I
10.5220/0010478003610367
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many provenance systems assume that the data flow is being directly orchestrated by them or logs are present which describe it. This works well until these assumptions do not hold anymore. The Coscine platform is a way for researchers to connect to different storage providers and annotate their stored data with discipline-specific metadata. These storage providers, however, do not inform the platform of externally induced changes for example by the user. Therefore, this paper focuses on the need of data provenance that is not directly produced and has to be deduced after the fact. An approach is proposed for dealing with and creating such asynchronous data provenance which makes use of change indicators that deduce if a data entity has been modified. A representation on how to describe such an asynchronous data provenance in the Resource Description Framework (RDF) is discussed. Finally, a prototypical implementation of the approach in the Coscine use-case is described and the future steps for the approach and prototype are detailed.
引用
收藏
页码:361 / 367
页数:7
相关论文
共 24 条
[1]  
[Anonymous], 2008, Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data
[2]  
Belhajjame K., 2012, PROV-O: The PROV Ontology
[3]  
Bensberg S., 2020, EFFICIENT SEMANTIC S
[4]  
Brickley D., 2014, FOAF Vocabulary Speci cation
[5]  
Cyganiak R., 2014, W3C Recommendation
[6]  
Davidson S., 2007, IEEE DATA ENG B, V32, P44, DOI DOI 10.1145/1376616.1376772
[7]  
European Organization for Nuclear Research OpenAIRE, 2013, ZENODO
[8]  
Foster E.D., 2017, Journal of the Medical Library Association: JMLA, V105, P203, DOI DOI 10.5195/JMLA.2017.88
[9]   Schema.org: Evolution of Structured Data on the Web [J].
Guha, R. V. ;
Brickley, Dan ;
Macbeth, Steve .
COMMUNICATIONS OF THE ACM, 2016, 59 (02) :44-51
[10]  
Heinrichs B., 2020, MOVING GEN METADATA