Tracking provenance of earth science data

被引:15
|
作者
Tilmes, Curt [1 ]
Yesha, Yelena [2 ]
Halem, Milton [2 ]
机构
[1] NASA, Goddard Space Flight Ctr, Greenbelt, MD 20771 USA
[2] Univ Maryland, Baltimore, MD 21250 USA
基金
美国国家科学基金会;
关键词
Data processing; Provenance;
D O I
10.1007/s12145-010-0046-3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Tremendous volumes of data have been captured, archived and analyzed. Sensors, algorithms and processing systems for transforming and analyzing the data are evolving over time. Web Portals and Services can create transient data sets on-demand. Data are transferred from organization to organization with additional transformations at every stage. Provenance in this context refers to the source of data and a record of the process that led to its current state. It encompasses the documentation of a variety of artifacts related to particular data. Provenance is important for understanding and using scientific datasets, and critical for independent confirmation of scientific results. Managing provenance throughout scientific data processing has gained interest lately and there are a variety of approaches. Large scale scientific datasets consisting of thousands to millions of individual data files and processes offer particular challenges. This paper uses the analogy of art history provenance to explore some of the concerns of applying provenance tracking to earth science data. It also illustrates some of the provenance issues with examples drawn from the Ozone Monitoring Instrument (OMI) Data Processing System (OMIDAPS) (Tilmes et al. 2004) run at NASA's Goddard Space Flight Center by the first author.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 50 条
  • [41] TRACKING THE EARTH SATELLITE, AND DATA TRANSMISSION, BY RADIO
    MENGEL, JT
    PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1956, 44 (06): : 755 - 760
  • [42] Inferred Interactive Controls Through Provenance Tracking of ROS Message Data
    Witte, Thomas
    Tichy, Matthias
    2021 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON ROBOTICS SOFTWARE ENGINEERING (ROSE 2021), 2021, : 67 - 74
  • [43] se1P: Selective Tracking and Presentation of Data Provenance
    Deutch, Daniel
    Gilad, Amir
    Moskovitch, Yuval
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1484 - 1487
  • [44] A MODEL FOR EARTH SCIENCE DATA-STRUCTURES DESIGNED TO PROMOTE GENERALIZATION OF EARTH SCIENCE DATA-PROCESSING SYSTEMS
    HOLROYD, MT
    JOURNAL OF THE INTERNATIONAL ASSOCIATION FOR MATHEMATICAL GEOLOGY, 1983, 15 (01): : 229 - 230
  • [45] THE PROVENANCE OF NAVIGATIONAL SCIENCE
    WILLIAMS, JED
    JOURNAL OF NAVIGATION, 1988, 41 (01): : 1 - 24
  • [46] Bridging sustainability science, earth science, and data science through interdisciplinary education
    Deana Pennington
    Imme Ebert-Uphoff
    Natalie Freed
    Jo Martin
    Suzanne A. Pierce
    Sustainability Science, 2020, 15 : 647 - 661
  • [47] Bridging sustainability science, earth science, and data science through interdisciplinary education
    Pennington, Deana
    Ebert-Uphoff, Imme
    Freed, Natalie
    Martin, Jo
    Pierce, Suzanne A.
    SUSTAINABILITY SCIENCE, 2020, 15 (02) : 647 - 661
  • [48] Improving Reproducibility of Data Science Pipelines through Transparent Provenance Capture
    Rupprecht, Lukas
    Davis, James C.
    Arnold, Constantine
    Gur, Yaniv
    Bhagwat, Deepavali
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (12): : 3354 - 3368
  • [49] Capturing Provenance for Runtime Data Analysis in Computational Science and Engineering Applications
    Silva, Vitor
    Souza, Renan
    Camata, Jose
    de Oliveira, Daniel
    Valduriez, Patrick
    Coutinho, Alvaro L. G. A.
    Mattoso, Marta
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2018, 2018, 11017 : 183 - 187
  • [50] The contribution and reuse of LTER data in the Provenance Aware Synthesis Tracking Architecture (PASTA) data repository
    Servilla, Mark
    Brunt, James
    Costa, Duane
    McGann, Jeanine
    Waide, Robert
    ECOLOGICAL INFORMATICS, 2016, 36 : 247 - 258