Extending the data model for data-centric metagenomics analysis using scientific workflows in CAMERA

被引:0
|
作者
Altintas I. [1 ]
Chen J. [2 ]
Sedova M. [1 ]
Gupta A. [1 ]
Sun S. [2 ]
Lin A.W. [3 ]
Gujral M. [1 ]
Anand M.K. [1 ]
Li W. [3 ]
Grethe J.S. [3 ]
Ellisman M. [3 ,4 ]
机构
[1] San Diego Supercomputer Center, University of California, San Diego, San Diego, CA
[2] California Institute for Telecommunications and Information Technology, University of California, San Diego, San Diego, CA
[3] Center for Research in Biological Systems, University of California, San Diego, San Diego, CA
[4] Neurosciences and Bioengineering, University of California, San Diego, San Diego, CA
来源
Proceedings - 6th IEEE International Conference on e-Science Workshops, e-ScienceW 2010 | 2010年
关键词
D O I
10.1109/eScienceW.2010.18
中图分类号
学科分类号
摘要
Community Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA) is an eScience project to enable the microbial ecology community in managing the challenges of metagenomics analysis. CAMERA supports extensive metadata based data acquisition and access, as well as execution of metagenomics experiments through standard and customized scientific workflows. Users can use a wide range of community analysis tools to select and invoke integrated annotation of genomic datasets. Users can also search and sort information based on selected metadata over the underlying semantic database. We present the semantic data model of CAMERA and its integration with scientific workflow execution information. We also describe how this model is used to interlink related workflows, where outputs of previous workflow executions can be used as inputs by subsequent workflow executions. We demonstrate the effectiveness of our model and approach through scenarios built on currently supported CAMERA workflows and analysis. © 2010 IEEE.
引用
收藏
页码:49 / 56
页数:7
相关论文
共 50 条
  • [1] An Algebraic Approach for Data-Centric Scientific Workflows
    Ogasawara, Eduardo
    Dias, Jonas
    de Oliveira, Daniel
    Porto, Fabio
    Valduriez, Patrick
    Mattoso, Marta
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (12): : 1328 - 1339
  • [2] A framework for collecting provenance in data-centric scientific workflows
    Simmhan, Yogesh L.
    Plale, Beth
    Gannon, Dennis
    ICWS 2006: IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2006, : 427 - +
  • [3] Orchestrating Data-Centric Workflows
    Barker, Adam
    Weissman, Jon B.
    van Hemert, Jano
    CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 210 - 217
  • [4] Methodological Approach to Data-Centric Cloudification of Scientific Iterative Workflows
    Caino-Lores, Silvina
    Lapin, Andrei
    Kropf, Peter
    Carretero, Jesus
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2016, 2016, 10048 : 469 - 482
  • [5] Realising Data-Centric Scientific Workflows with Provenance-Capturing on Data Lakes
    Hendrik Nolte
    Philipp Wieder
    Data Intelligence, 2022, 4 (02) : 426 - 438
  • [6] Realising Data-Centric Scientific Workflows with Provenance-Capturing on Data Lakes
    Nolte, Hendrik
    Wieder, Philipp
    DATA INTELLIGENCE, 2022, 4 (02) : 426 - 438
  • [7] Data-centric iteration in dynamic workflows
    Dias, Jonas
    Guerra, Gabriel
    Rochinha, Fernando
    Coutinho, Alvaro L. G. A.
    Valduriez, Patrick
    Mattoso, Marta
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 46 : 114 - 126
  • [8] A Data-Centric Framework for Composable NLP Workflows
    Liu, Zhengzhong
    Ding, Guanxiong
    Bukkittu, Avinash
    Gupta, Mansi
    Gao, Pengzhi
    Ahmed, Atif
    Zhang, Shikun
    Gao, Xin
    Singhavi, Swapnil
    Li, Linwei
    Wei, Wei
    Hu, Zecong
    Shi, Haoran
    Liang, Xiaodan
    Mitamura, Teruko
    Xing, Eric P.
    Hu, Zhiting
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING: SYSTEM DEMONSTRATIONS, 2020, : 197 - 204
  • [9] In-memory staging and data-centric task placement for coupled scientific simulation workflows
    Zhang, Fan
    Jin, Tong
    Sun, Qian
    Romanus, Melissa
    Bui, Hoang
    Klasky, Scott
    Parashar, Manish
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (12):
  • [10] Decentralized orchestration of data-centric workflows in Cloud environments
    Javadi, Bahman
    Tomko, Martin
    Sinnott, Richard O.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (07): : 1826 - 1837