Tracking provenance of earth science data

被引:15
|
作者
Tilmes, Curt [1 ]
Yesha, Yelena [2 ]
Halem, Milton [2 ]
机构
[1] NASA, Goddard Space Flight Ctr, Greenbelt, MD 20771 USA
[2] Univ Maryland, Baltimore, MD 21250 USA
基金
美国国家科学基金会;
关键词
Data processing; Provenance;
D O I
10.1007/s12145-010-0046-3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Tremendous volumes of data have been captured, archived and analyzed. Sensors, algorithms and processing systems for transforming and analyzing the data are evolving over time. Web Portals and Services can create transient data sets on-demand. Data are transferred from organization to organization with additional transformations at every stage. Provenance in this context refers to the source of data and a record of the process that led to its current state. It encompasses the documentation of a variety of artifacts related to particular data. Provenance is important for understanding and using scientific datasets, and critical for independent confirmation of scientific results. Managing provenance throughout scientific data processing has gained interest lately and there are a variety of approaches. Large scale scientific datasets consisting of thousands to millions of individual data files and processes offer particular challenges. This paper uses the analogy of art history provenance to explore some of the concerns of applying provenance tracking to earth science data. It also illustrates some of the provenance issues with examples drawn from the Ozone Monitoring Instrument (OMI) Data Processing System (OMIDAPS) (Tilmes et al. 2004) run at NASA's Goddard Space Flight Center by the first author.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 50 条
  • [41] Computational provenance in hydrologic science: a snow mapping example
    Dozier, Jeff
    Frew, James
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2009, 367 (1890): : 1021 - 1033
  • [42] The requirements of using provenance in e-science experiments
    Miles S.
    Groth P.
    Branco M.
    Moreau L.
    Journal of Grid Computing, 2007, 5 (1) : 1 - 25
  • [43] ProvVacT: A Provenance Based mHealth Application for Tracking Vaccine History
    Yilmazer Demirel, Dilek
    Can, Ozgu
    2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 1837 - 1842
  • [44] SAF: A Provenance-Tracking Framework for Interoperable Semantic Applications
    Patton, Evan W.
    Difranzo, Dominic
    McGuinness, Deborah L.
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2010, 6378 : 73 - 77
  • [45] Investigating reproducibility and tracking provenance - A genomic workflow case study
    Kanwal, Sehrish
    Khan, Farah Zaib
    Lonie, Andrew
    Sinnott, Richard O.
    BMC BIOINFORMATICS, 2017, 18
  • [46] Investigating reproducibility and tracking provenance – A genomic workflow case study
    Sehrish Kanwal
    Farah Zaib Khan
    Andrew Lonie
    Richard O. Sinnott
    BMC Bioinformatics, 18
  • [47] A conceptual model for tracking the provenance of activities in knowledge organization systems
    Choi, Inkyung
    Cheng, Yi-Yun
    JOURNAL OF DOCUMENTATION, 2025, 81 (01) : 147 - 167
  • [48] National Tibetan Plateau Data Center Promoting Earth System Science on the Third Pole
    Pan, Xiaoduo
    Guo, Xuejun
    Li, Xin
    Niu, Xiaolei
    Yang, Xiaojuan
    Feng, Min
    Che, Tao
    Jin, Rui
    Ran, Youhua
    Guo, Jianwen
    Hu, Xiaoli
    Wu, Adan
    BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2021, 102 (11) : E2062 - E2078
  • [49] A Scientific Data Provenance API for Distributed Applications
    Raju, Bibi
    Elsethagen, Todd
    Stephan, Eric
    Van Dam, Kerstin Kleese
    2016 INTERNATIONAL CONFERENCE ON COLLABORATION TECHNOLOGIES AND SYSTEMS (CTS), 2016, : 104 - 111
  • [50] Rare earth element chemistry of zircon and its use as a provenance indicator
    Hoskin, PWO
    Ireland, TR
    GEOLOGY, 2000, 28 (07) : 627 - 630