Supporting Data-driven Workflows Enabled by Large Scale Observatories

被引:3
作者
Zamani, Ali Reza [1 ]
AbdelBaky, Moustafa [1 ]
Balouek-Thomert, Daniel [1 ]
Rodero, Ivan [1 ]
Parashar, Manish [1 ]
机构
[1] Rutgers State Univ, Rutgers Discovery Informat Inst RDI2, Piscataway, NJ 08854 USA
来源
2017 IEEE 13TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE) | 2017年
基金
美国国家科学基金会;
关键词
Large scale observatories; Data-driven workflows; Wide-area data analytics; Large-scale science;
D O I
10.1109/eScience.2017.95
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Large scale observatories are shared-use resources that provide open access to data from geographically distributed sensors and instruments. This data has the potential to accelerate scientific discovery. However, seamlessly integrating the data into scientific workflows remains a challenge. In this paper, we summarize our ongoing work in supporting data-driven and data-intensive workflows and outline our vision for how these observatories can improve large-scale science. Specifically, we present programming abstractions and runtime management services to enable the automatic integration of data in scientific workflows. Further, we show how approximation techniques can be used to address network and processing variations by studying constraint limitations and their associated latencies. We use the Ocean Observatories Initiative (OOI) as a driving use case for this work.
引用
收藏
页码:592 / 595
页数:4
相关论文
共 20 条
  • [1] Altintas I, 2004, P INT C SCI STAT DAT, V16, P423, DOI DOI 10.1109/SSDM.2004.1311241
  • [2] [Anonymous], 2015, CIDR
  • [3] High performance threaded data streaming for large scale simulations
    Bhat, V
    Klasky, S
    Atchley, S
    Beck, M
    McCune, D
    Parashar, M
    [J]. FIFTH IEEE/ACM INTERNATIONAL WORKSHOP ON GRID COMPUTING, PROCEEDINGS, 2004, : 243 - 250
  • [4] Bhat V., 2006, Proceedings. 3rd International Conference on Autonomic Computing (IEEE Cat. No. 06EX1303), P15
  • [5] Chun BG, 2011, EUROSYS 11: PROCEEDINGS OF THE EUROSYS 2011 CONFERENCE, P301
  • [6] Deelman E., 2005, Scientific Programming, V13, P219
  • [7] Deelman Ewa, 2017, INT J HIGH PERFORMAN
  • [8] Fahringer T, 2005, 2005 6TH INTERNATIONAL WORKSHOP ON GRID COMPUTING (GRID), P122
  • [9] Trading Timeliness and Accuracy in Geo-Distributed Streaming Analytics
    Heintz, Benjamin
    Chandra, Abhishek
    Sitaraman, Ramesh K.
    [J]. PROCEEDINGS OF THE SEVENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC 2016), 2016, : 361 - 373
  • [10] Jonathan Albert, 2017, IEEE T PARALLEL DIST