Provenance Analysis: Towards Quality Provenance

被引:0
作者
Cheah, You-Wei [1 ]
Plale, Beth [1 ]
机构
[1] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47405 USA
来源
2012 IEEE 8TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE) | 2012年
关键词
Data Provenance; Provenance Quality; Scientific Workflows; Provenance Analysis;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data provenance, a key piece of metadata that describes the lifecycle of a data product, is crucial in aiding scientists to better understand and facilitate reproducibility and reuse of scientific results. Provenance collection systems often capture provenance on the fly and the protocol between application and provenance tool may not be reliable. As a result, data provenance can become ambiguous or simply inaccurate. In this paper, we identify likely quality issues in data provenance. We also establish crucial quality dimensions that are especially critical for the evaluation of provenance quality. We analyze synthetic and real-world provenance based on these quality dimensions and summarize our contributions to provenance quality.
引用
收藏
页数:8
相关论文
共 25 条
[1]  
Anand M. K., 2010, EDBT
[2]  
Anand M. K., 2009, EDBT
[3]  
Bruce T.R., 2004, METADATA PRACTICE
[4]  
Chapman A., 2008, P ACM SIGMOD PODS C
[5]  
Cheah Y.-W., 2011, 2 INT WORKSH TRAC CO
[6]  
CURBERA F, 2008, 16 INT C COOP INF SY, V5331, P100
[7]  
Dai CY, 2008, LECT NOTES COMPUT SC, V5159, P82
[8]  
Davidson S. B., 2008, SIGMOD C
[9]  
Hartig O., 2009, 1 INT WORKSH ROL SEM
[10]  
Heinis T., 2008, SIGMOD