A Case for Data Commons: Toward Data Science as a Service

被引:47
|
作者
Grossman, Robert L. [1 ,2 ,3 ]
Heath, Allison [4 ]
Murphy, Mark [1 ]
Patterson, Maria [1 ]
Wells, Walt [5 ]
机构
[1] Univ Chicago, Ctr Data Intens Sci, Chicago, IL 60637 USA
[2] Univ Chicago, Div Biol Sci, Chicago, IL 60637 USA
[3] Univ Chicago, Computat Inst, Chicago, IL 60637 USA
[4] Univ Chicago, Ctr Data Intens Sci, Res, Chicago, IL 60637 USA
[5] Ctr Computat Sci Res, New York, NY USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
cloud computing; data as a service; data commons; science as a service; scientific computing; software as services;
D O I
10.1109/MCSE.2016.92
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data commons collocate data, storage, and computing infrastructure with core services and commonly used tools and applications for managing, analyzing, and sharing data to create an interoperable resource for the research community. An architecture for data commons is described, as well as some lessons learned from operating several large-scale data commons.
引用
收藏
页码:10 / 20
页数:11
相关论文
共 50 条
  • [31] Sustaining the Data and Bioresource Commons
    Schofield, Paul N.
    Eppig, Janan
    Huala, Eva
    de Angelis, Martin Hrabe
    Harvey, Mark
    Davidson, Duncan
    Weaver, Tom
    Brown, Steve
    Smedley, Damian
    Rosenthal, Nadia
    Schughart, Klaus
    Aidinis, Vassilis
    Tocchini-Valentini, Glauco
    Hancock, John M.
    SCIENCE, 2010, 330 (6004) : 592 - 593
  • [32] The NCI Genomic Data Commons
    Allison P. Heath
    Vincent Ferretti
    Stuti Agrawal
    Maksim An
    James C. Angelakos
    Renuka Arya
    Rosita Bajari
    Bilal Baqar
    Justin H. B. Barnowski
    Jeffrey Burt
    Ann Catton
    Brandon F. Chan
    Fay Chu
    Kim Cullion
    Tanja Davidsen
    Phuong-My Do
    Christian Dompierre
    Martin L. Ferguson
    Michael S. Fitzsimons
    Michael Ford
    Miyuki Fukuma
    Sharon Gaheen
    Gajanan L. Ganji
    Tzintzuni I. Garcia
    Sameera S. George
    Daniela S. Gerhard
    Francois Gerthoffert
    Fauzi Gomez
    Kang Han
    Kyle M. Hernandez
    Biju Issac
    Richard Jackson
    Mark A. Jensen
    Sid Joshi
    Ajinkya Kadam
    Aishmit Khurana
    Kyle M. J. Kim
    Victoria E. Kraft
    Shenglai Li
    Tara M. Lichtenberg
    Janice Lodato
    Laxmi Lolla
    Plamen Martinov
    Jeffrey A. Mazzone
    Daniel P. Miller
    Ian Miller
    Joshua S. Miller
    Koji Miyauchi
    Mark W. Murphy
    Thomas Nullet
    Nature Genetics, 2021, 53 : 257 - 262
  • [33] The NCI Genomic Data Commons
    Heath, Allison P.
    Ferretti, Vincent
    Agrawal, Stuti
    An, Maksim
    Angelakos, James C.
    Arya, Renuka
    Bajari, Rosita
    Baqar, Bilal
    Barnowski, Justin H. B.
    Burt, Jeffrey
    Catton, Ann
    Chan, Brandon F.
    Chu, Fay
    Cullion, Kim
    Davidsen, Tanja
    Do, Phuong-My
    Dompierre, Christian
    Ferguson, Martin L.
    Fitzsimons, Michael S.
    Ford, Michael
    Fukuma, Miyuki
    Gaheen, Sharon
    Ganji, Gajanan L.
    Garcia, Tzintzuni I.
    George, Sameera S.
    Gerhard, Daniela S.
    Gerthoffert, Francois
    Gomez, Fauzi
    Han, Kang
    Hernandez, Kyle M.
    Issac, Biju
    Jackson, Richard
    Jensen, Mark A.
    Joshi, Sid
    Kadam, Ajinkya
    Khurana, Aishmit
    Kim, Kyle M. J.
    Kraft, Victoria E.
    Li, Shenglai
    Lichtenberg, Tara M.
    Lodato, Janice
    Lolla, Laxmi
    Martinov, Plamen
    Mazzone, Jeffrey A.
    Miller, Daniel P.
    Miller, Ian
    Miller, Joshua S.
    Miyauchi, Koji
    Murphy, Mark W.
    Nullet, Thomas
    NATURE GENETICS, 2021, 53 (03) : 257 - 262
  • [34] Genomic Data Commons Expands
    不详
    JOURNAL OF NUCLEAR MEDICINE, 2016, 57 (09) : 21N - 21N
  • [35] NCI Imaging Data Commons
    Fedorov, A.
    Longabaugh, W.
    Pot, D.
    Clunie, D.
    Pieper, S.
    Lewis, R.
    Aerts, H.
    Homeyer, A.
    Herrmann, M.
    Wagner, U.
    Pihl, T.
    Farahani, K.
    Kikinis, R.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [36] NCI Imaging Data Commons
    Fedorov, A.
    Longabaugh, W.
    Pot, D.
    Clunie, D.
    Pieper, S.
    Lewis, R.
    Aerts, H.
    Homeyer, A.
    Herrmann, M.
    Wagner, U.
    Pihl, T.
    Farahani, K.
    Kikinis, R.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2021, 111 (03): : E101 - E101
  • [37] Managing the data commons: Controlled sharing of scholarly data
    Eschenfelder, Kristin R.
    Johnson, Andrew
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2014, 65 (09) : 1757 - 1774
  • [38] ICSSR Data Service: A National Initiative for Sharing of Social Science Research Data
    Arora, Jagdish
    Pradhan, Pallab
    Patel, Yatrik
    Pandya, Miteshkumar
    Solanki, Hiteshkumar
    Vaghela, Divyakant
    DATA SCIENCE LANDSCAPE: TOWARDS RESEARCH STANDARDS AND PROTOCOLS, 2018, 38 : 107 - 125
  • [39] Self-service Data Science for Healthcare Professionals: A Data Preparation Approach
    Spruit, Marco
    Dedding, Thomas
    Vijlbrief, Daniel
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2020, : 724 - 734
  • [40] DATA SHARING IN THE SERVICE OF SCIENCE AND SOCIETY: IMPLEMENTATION OF THE GEOSS DATA SHARING PRINCIPLES
    Kamei, Masatoshi
    NETWORKING THE WORLD WITH REMOTE SENSING, 2010, 38 : 182 - 185