The NCI Genomic Data Commons

被引:81
作者
Heath, Allison P. [1 ]
Ferretti, Vincent [2 ,9 ]
Agrawal, Stuti [3 ,10 ]
An, Maksim [3 ,11 ]
Angelakos, James C. [3 ]
Arya, Renuka [3 ]
Bajari, Rosita [2 ]
Baqar, Bilal [3 ]
Barnowski, Justin H. B. [3 ,12 ]
Burt, Jeffrey [2 ]
Catton, Ann [2 ]
Chan, Brandon F. [2 ]
Chu, Fay [4 ]
Cullion, Kim [2 ]
Davidsen, Tanja [5 ]
Do, Phuong-My [2 ]
Dompierre, Christian [3 ,13 ]
Ferguson, Martin L. [5 ]
Fitzsimons, Michael S. [3 ,14 ]
Ford, Michael [3 ,15 ]
Fukuma, Miyuki [2 ]
Gaheen, Sharon [6 ]
Ganji, Gajanan L. [3 ]
Garcia, Tzintzuni I. [7 ]
George, Sameera S. [3 ,16 ]
Gerhard, Daniela S. [5 ]
Gerthoffert, Francois [2 ,17 ]
Gomez, Fauzi [3 ]
Han, Kang [3 ]
Hernandez, Kyle M. [7 ]
Issac, Biju [6 ]
Jackson, Richard [3 ,12 ]
Jensen, Mark A. [6 ]
Joshi, Sid [2 ]
Kadam, Ajinkya [3 ,18 ]
Khurana, Aishmit [2 ]
Kim, Kyle M. J. [2 ]
Kraft, Victoria E. [3 ]
Li, Shenglai [3 ]
Lichtenberg, Tara M.
Lodato, Janice [3 ,19 ]
Lolla, Laxmi [4 ]
Martinov, Plamen [8 ,20 ]
Mazzone, Jeffrey A. [3 ]
Miller, Daniel P. [1 ,3 ]
Miller, Ian [3 ]
Miller, Joshua S. [3 ]
Miyauchi, Koji [2 ]
Murphy, Mark W. [3 ,21 ]
Nullet, Thomas [3 ]
机构
[1] Childrens Hosp Philadelphia, Philadelphia, PA 19104 USA
[2] Ontario Inst Canc Res, Toronto, ON, Canada
[3] Univ Chicago, Ctr Translat Data Sci, Chicago, IL 60637 USA
[4] Essential Software, Rockville, MD USA
[5] NCI, Bethesda, MD 20892 USA
[6] Frederick Natl Lab Canc Res, Leidos Biomed Res, Frederick, MD USA
[7] Univ Chicago, Ctr Res Informat, Chicago, IL 60637 USA
[8] Univ Chicago, Div Biol Sci, Informat Secur Off, Chicago, IL 60637 USA
[9] Univ Montreal, CHU St Justine, Montreal, PQ, Canada
[10] Merck Healthcare, Darmstadt, Germany
[11] Microsoft Corp, Redmond, WA 98052 USA
[12] Neighborhoodscom, Chicago, IL USA
[13] Sympatic, Chicago, IL USA
[14] Univ Illinois, Chicago, IL USA
[15] ModelOp, Chicago, IL USA
[16] BenchPrep, Chicago, IL USA
[17] Jahia Solut, Toronto, ON, Canada
[18] Quantcast, Chicago, IL USA
[19] Amount, Chicago, IL USA
[20] Open Commons Consortium, Chicago, IL USA
[21] Cisco, San Jose, CA USA
[22] Fdn Progreso & Salud FPS, Clin Bioinformat Area, Seville, Spain
[23] Tempus, Chicago, IL USA
[24] Cole Parmer, Vernon Hills, IL USA
[25] Blink Hlth, New York, NY USA
[26] Argonne Natl Labs, Argonne, IL USA
[27] Civis Analyt, Chicago, IL USA
[28] Google, Chicago, IL USA
[29] Univ Notre Dame, Notre Dame, IN 46556 USA
[30] Appify, San Francisco, CA USA
[31] Peyk, Los Angeles, CA USA
基金
美国国家卫生研究院;
关键词
D O I
10.1038/s41588-021-00791-5
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The National Cancer Institute (NCI) Genomic Data Commons (GDC) contains more than 2.9 petabytes of genomic and associated clinical data from more than 60 NCI-funded and other contributed cancer genomics research projects. The GDC consists of five applications over a common data model and a common application programming interface.
引用
收藏
页码:257 / 262
页数:6
相关论文
共 13 条
[1]   Progress Toward Cancer Data Ecosystems [J].
Grossman, Robert L. .
CANCER JOURNAL, 2018, 24 (03) :122-126
[2]   A Case for Data Commons: Toward Data Science as a Service [J].
Grossman, Robert L. ;
Heath, Allison ;
Murphy, Mark ;
Patterson, Maria ;
Wells, Walt .
COMPUTING IN SCIENCE & ENGINEERING, 2016, 18 (05) :10-20
[3]   Bionimbus: a cloud for managing, analyzing and sharing large genomics datasets [J].
Heath, Allison P. ;
Greenway, Matthew ;
Powell, Raymond ;
Spring, Jonathan ;
Suarez, Rafael ;
Hanley, David ;
Bandlamudi, Chai ;
McNerney, Megan E. ;
White, Kevin P. ;
Grossman, Robert L. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (06) :969-975
[4]   A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine [J].
Hinkson, Izumi V. ;
Davidsen, Tanja M. ;
Klemm, Juli D. ;
Chandramouliswaran, Ishwar ;
Kerlavage, Anthony R. ;
Kibbe, Warren A. .
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2017, 5
[5]   MSEA: detection and quantification of mutation hotspots through mutation set enrichment analysis [J].
Jia, Peilin ;
Wang, Quan ;
Chen, Qingxia ;
Hutchinson, Katherine E. ;
Pao, William ;
Zhao, Zhongming .
GENOME BIOLOGY, 2014, 15 (10) :489
[6]   Discovery and saturation analysis of cancer genes across 21 tumour types [J].
Lawrence, Michael S. ;
Stojanov, Petar ;
Mermel, Craig H. ;
Robinson, James T. ;
Garraway, Levi A. ;
Golub, Todd R. ;
Meyerson, Matthew ;
Gabriel, Stacey B. ;
Lander, Eric S. ;
Getz, Gad .
NATURE, 2014, 505 (7484) :495-+
[7]   Tackling the widespread and critical impact of batch effects in high-throughput data [J].
Leek, Jeffrey T. ;
Scharpf, Robert B. ;
Bravo, Hector Corrada ;
Simcha, David ;
Langmead, Benjamin ;
Johnson, W. Evan ;
Geman, Donald ;
Baggerly, Keith ;
Irizarry, Rafael A. .
NATURE REVIEWS GENETICS, 2010, 11 (10) :733-739
[8]   Activating mutation in the tyrosine kinase JAK2 in polycythemia vera, essential thrombocythemia, and myeloid metaplasia with myelofibrosis [J].
Levine, RL ;
Wadleigh, M ;
Cools, J ;
Ebert, BL ;
Wernig, G ;
Huntly, BJP ;
Boggon, TJ ;
Wlodarska, L ;
Clark, JJ ;
Moore, S ;
Adelsperger, J ;
Koo, S ;
Lee, JC ;
Gabriel, S ;
Mercher, T ;
D'Andrea, A ;
Fröhling, S ;
Döhner, K ;
Marynen, P ;
Vandenberghe, P ;
Mesa, RA ;
Tefferi, A ;
Griffin, JD ;
Eck, MJ ;
Sellers, WR ;
Meyerson, M ;
Golub, TR ;
Lee, SJ ;
Gilliland, DG .
CANCER CELL, 2005, 7 (04) :387-397
[9]   The n-of-1 clinical trial: the ultimate strategy for individualizing medicine? [J].
Lillie, Elizabeth O. ;
Patay, Bradley ;
Diamant, Joel ;
Issell, Brian ;
Topol, Eric J. ;
Schork, Nicholas J. .
PERSONALIZED MEDICINE, 2011, 8 (02) :161-173
[10]   The NCBI dbGaP database of genotypes and phenotypes [J].
Mailman, Matthew D. ;
Feolo, Michael ;
Jin, Yumi ;
Kimura, Masato ;
Tryka, Kimberly ;
Bagoutdinov, Rinat ;
Hao, Luning ;
Kiang, Anne ;
Paschall, Justin ;
Phan, Lon ;
Popova, Natalia ;
Pretel, Stephanie ;
Ziyabari, Lora ;
Lee, Moira ;
Shao, Yu ;
Wang, Zhen Y. ;
Sirotkin, Karl ;
Ward, Minghong ;
Kholodov, Michael ;
Zbicz, Kerry ;
Beck, Jeffrey ;
Kimelman, Michael ;
Shevelev, Sergey ;
Preuss, Don ;
Yaschenko, Eugene ;
Graeff, Alan ;
Ostell, James ;
Sherry, Stephen T. .
NATURE GENETICS, 2007, 39 (10) :1181-1186