Interactive Provenance Summaries for Reproducible Science

被引:0
作者
Li, Xiang [1 ]
Xu, Xiaoyang [1 ]
Malik, Tanu [2 ]
机构
[1] Univ Chicago, Computat Inst, Chicago, IL 60637 USA
[2] Depaul Univ, Sch Comp, Chicago, IL 60604 USA
来源
PROCEEDINGS OF THE 2016 IEEE 12TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE) | 2016年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recorded provenance facilitates reproducible science. Provenance metadata can help determine how data were possibly transformed, processed, and derived from original sources. While provenance is crucial for verification and validation, there remains the issue of the granularity-detail at which provenance data must be provided to a user, especially for conducting reproducible science. When data are reproduced successfully the need for detailed provenance is minimal and an essence of the recorded provenance suffices. However, when data are not reproduced correctly users want to quickly drill down into fine-grained provenance to understand causes for failure. In this paper, we describe a drill-up/drill-down method for exploring provenance traces. The drill-up method summarizes the trace by grouping nodes and edges of the trace that have same derivation histories. The method preserves provenance data flow semantics. The drill-down method compares summary groups and ranks groups that may have information about the errors. Both the methods are implemented in an efficient manner using light-weight data structures so as to be suitable for reproducible science. We conduct a thorough experimental analysis to show how the operators perform in compressing and expanding real provenance graphs.
引用
收藏
页码:355 / 360
页数:6
相关论文
共 16 条
[1]   Provenance Browser: Displaying and Querying Scientific Workflow Provenance Graphs [J].
Anand, Manish Kumar ;
Bowers, Shawn ;
Ludaescher, Bertram .
26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, :1201-1204
[2]  
[Anonymous], 2016, DAGSTUHL SEMINAR
[3]   Addressing the provenance challenge using ZOOM [J].
Cohen-Boulakia, Sarah ;
Biton, Olivier ;
Cohen, Shirley ;
Davidson, Susan .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2008, 20 (05) :497-506
[4]  
Freire J., 2012, SIGMOD
[5]   Provenance management in Swift [J].
Gadelha, Luiz M. R., Jr. ;
Clifford, Ben ;
Mattoso, Marta ;
Wilde, Michael ;
Foster, Ian .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2011, 27 (06) :775-780
[6]  
Huo D., 2016, SMART CONTAINERS ONT
[7]   Surface soil moisture parameterization of the VIC-2L model: Evaluation and modification [J].
Liang, X ;
Wood, EF ;
Lettenmaier, DP .
GLOBAL AND PLANETARY CHANGE, 1996, 13 (1-4) :195-206
[8]   Local Clustering in Provenance Graphs [J].
Macko, Peter ;
Margo, Daniel ;
Seltzer, Margo .
PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, :835-840
[9]  
Malik T., 2010, INT C ESCIENCE
[10]  
Malik Tanu., 2014, SOLE: Towards Descriptive and Interactive Publications