A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine

被引:59
作者
Hinkson, Izumi V. [1 ,2 ]
Davidsen, Tanja M. [1 ]
Klemm, Juli D. [1 ]
Chandramouliswaran, Ishwar [3 ]
Kerlavage, Anthony R. [1 ]
Kibbe, Warren A. [1 ,4 ]
机构
[1] NCI, Ctr Biomed Informat & Informat Technol, Rockville, MD 20850 USA
[2] Amer Assoc Advancement Sci, Sci & Technol Policy Fellowship Program, Washington, DC USA
[3] NIAID, Off Genom & Adv Technol, 9000 Rockville Pike, Bethesda, MD 20892 USA
[4] Duke Univ, Dept Biostat & Bioinformat, Sch Med, Durham, NC USA
来源
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY | 2017年 / 5卷
基金
美国国家卫生研究院;
关键词
genomics; proteomics; imaging; big data; cancer; precision medicine; cloud infrastructure; PROTEOGENOMIC CHARACTERIZATION; GENOMIC CHARACTERIZATION;
D O I
10.3389/fcell.2017.00083
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Advancements in next-generation sequencing and other -omics technologies are accelerating the detailed molecular characterization of individual patient tumors, and driving the evolution of precision medicine. Cancer is no longer considered a single disease, but rather, a diverse array of diseases wherein each patient has a unique collection of germline variants and somatic mutations. Molecular profiling of patient-derived samples has led to a data explosion that could help us understand the contributions of environment and germline to risk, therapeutic response, and outcome. To maximize the value of these data, an interdisciplinary approach is paramount. The National Cancer Institute (NCI) has initiated multiple projects to characterize tumor samples using multi-omic approaches. These projects harness the expertise of clinicians, biologists, computer scientists, and software engineers to investigate cancer biology and therapeutic response in multidisciplinary teams. Petabytes of cancer genomic, transcriptomic, epigenomic, proteomic, and imaging data have been generated by these projects. To address the data analysis challenges associated with these large datasets, the NCI has sponsored the development of the Genomic Data Commons (GDC) and three Cloud Resources. The GDC ensures data and metadata quality, ingests and harmonizes genomic data, and securely redistributes the data. During its pilot phase, the Cloud Resources tested multiple cloud-based approaches for enhancing data access, collaboration, computational scalability, resource democratization, and reproducibility. These NCI-led efforts are continuously being refined to better support open data practices and precision oncology, and to serve as building blocks of the NCI Cancer Research Data Commons.
引用
收藏
页数:7
相关论文
共 26 条
[1]  
Andersen Ross, 2012, ATLANTIC
[2]   Integrated genomic analyses of ovarian carcinoma [J].
Bell, D. ;
Berchuck, A. ;
Birrer, M. ;
Chien, J. ;
Cramer, D. W. ;
Dao, F. ;
Dhir, R. ;
DiSaia, P. ;
Gabra, H. ;
Glenn, P. ;
Godwin, A. K. ;
Gross, J. ;
Hartmann, L. ;
Huang, M. ;
Huntsman, D. G. ;
Iacocca, M. ;
Imielinski, M. ;
Kalloger, S. ;
Karlan, B. Y. ;
Levine, D. A. ;
Mills, G. B. ;
Morrison, C. ;
Mutch, D. ;
Olvera, N. ;
Orsulic, S. ;
Park, K. ;
Petrelli, N. ;
Rabeno, B. ;
Rader, J. S. ;
Sikic, B. I. ;
Smith-McCune, K. ;
Sood, A. K. ;
Bowtell, D. ;
Penny, R. ;
Testa, J. R. ;
Chang, K. ;
Dinh, H. H. ;
Drummond, J. A. ;
Fowler, G. ;
Gunaratne, P. ;
Hawes, A. C. ;
Kovar, C. L. ;
Lewis, L. R. ;
Morgan, M. B. ;
Newsham, I. F. ;
Santibanez, J. ;
Reid, J. G. ;
Trevino, L. R. ;
Wu, Y. -Q. ;
Wang, M. .
NATURE, 2011, 474 (7353) :609-615
[3]  
BRP, 2016, CANC MOONSH BLUE RIB
[4]   Integrated genomic and molecular characterization of cervical cancer [J].
Burk, Robert D. ;
Chen, Zigui ;
Saller, Charles ;
Tarvin, Katherine ;
Carvalho, Andre L. ;
Scapulatempo-Neto, Cristovam ;
Silveira, Henrique C. ;
Fregnani, Jose H. ;
Creighton, Chad J. ;
Anderson, Matthew L. ;
Castro, Patricia ;
Wang, Sophia S. ;
Yau, Christina ;
Benz, Christopher ;
Robertson, A. Gordon ;
Mungall, Karen ;
Lim, Lynette ;
Bowlby, Reanne ;
Sadeghi, Sara ;
Brooks, Denise ;
Sipahimalani, Payal ;
Mar, Richard ;
Ally, Adrian ;
Clarke, Amanda ;
Mungall, Andrew J. ;
Tam, Angela ;
Lee, Darlene ;
Chuah, Eric ;
Schein, Jacqueline E. ;
Tse, Kane ;
Kasaian, Katayoon ;
Ma, Yussanne ;
Marra, Marco A. ;
Mayo, Michael ;
Balasundaram, Miruna ;
Thiessen, Nina ;
Dhalla, Noreen ;
Carlsen, Rebecca ;
Moore, Richard A. ;
Holt, Robert A. ;
Jones, Steven J. M. ;
Wong, Tina ;
Pantazi, Angeliki ;
Parfenov, Michael ;
Kucherlapati, Raju ;
Hadjipanayis, Angela ;
Seidman, Jonathan ;
Kucherlapati, Melanie ;
Ren, Xiaojia ;
Xu, Andrew W. .
NATURE, 2017, 543 (7645) :378-+
[5]   Comprehensive genomic characterization defines human glioblastoma genes and core pathways [J].
Chin, L. ;
Meyerson, M. ;
Aldape, K. ;
Bigner, D. ;
Mikkelsen, T. ;
VandenBerg, S. ;
Kahn, A. ;
Penny, R. ;
Ferguson, M. L. ;
Gerhard, D. S. ;
Getz, G. ;
Brennan, C. ;
Taylor, B. S. ;
Winckler, W. ;
Park, P. ;
Ladanyi, M. ;
Hoadley, K. A. ;
Verhaak, R. G. W. ;
Hayes, D. N. ;
Spellman, Paul T. ;
Absher, D. ;
Weir, B. A. ;
Ding, L. ;
Wheeler, D. ;
Lawrence, M. S. ;
Cibulskis, K. ;
Mardis, E. ;
Zhang, Jinghui ;
Wilson, R. K. ;
Donehower, L. ;
Wheeler, D. A. ;
Purdom, E. ;
Wallis, J. ;
Laird, P. W. ;
Herman, J. G. ;
Schuebel, K. E. ;
Weisenberger, D. J. ;
Baylin, S. B. ;
Schultz, N. ;
Yao, Jun ;
Wiedemeyer, R. ;
Weinstein, J. ;
Sander, C. ;
Gibbs, R. A. ;
Gray, J. ;
Kucherlapati, R. ;
Lander, E. S. ;
Myers, R. M. ;
Perou, C. M. ;
McLendon, Roger .
NATURE, 2008, 455 (7216) :1061-1068
[6]   Relapsed neuroblastomas show frequent RAS-MAPK pathway mutations (vol 47, pg 864, 2015) [J].
Eleveld, Thomas F. ;
Oldridge, Derek A. ;
Bernard, Virginie ;
Koster, Jan ;
Daage, Leo Colmet ;
Diskin, Sharon J. ;
Schild, Linda ;
Bentahar, Nadia Bessoltane ;
Bellini, Angela ;
Chicard, Mathieu ;
Lapouble, Eve ;
Combaret, Valerie ;
Legoix-Ne, Patricia ;
Michon, Jean ;
Pugh, Trevor J. ;
Hart, Lori S. ;
Rader, JulieAnn ;
Attiyeh, Edward F. ;
Wei, Jun S. ;
Zhang, Shile ;
Naranjo, Arlene ;
Gastier-Foster, Julie M. ;
Hogarty, Michael D. ;
Asgharzadeh, Shahab ;
Smith, Malcolm A. ;
Auvil, Jaime M. Guidry ;
Watkins, Thomas B. K. ;
Zwijnenburg, Danny A. ;
Ebus, Marli E. ;
van Sluis, Peter ;
Hakkert, Anne ;
van Wezel, Esther ;
van der Schoot, C. Ellen ;
Westerhout, Ellen M. ;
Schulte, Johannes H. ;
Tytgat, Godelieve A. ;
Dolman, M. Emmy M. ;
Janoueix-Lerosey, Isabelle ;
Gerhard, Daniela S. ;
Caron, Huib N. ;
Delattre, Olivier ;
Khan, Javed ;
Versteeg, Rogier ;
Schleiermacher, Gudrun ;
Molenaar, Jan J. ;
Maris, John M. .
NATURE GENETICS, 2015, 47 (08) :864-+
[7]   Toward a Shared Vision for Cancer Genomic Data [J].
Grossman, Robert L. ;
Heath, Allison P. ;
Ferretti, Vincent ;
Varmus, Harold E. ;
Lowy, Douglas R. ;
Kibbe, Warren A. ;
Staudt, Louis M. .
NEW ENGLAND JOURNAL OF MEDICINE, 2016, 375 (12) :1109-1112
[8]   Comprehensive genomic characterization of squamous cell lung cancers [J].
Hammerman, Peter S. ;
Lawrence, Michael S. ;
Voet, Douglas ;
Jing, Rui ;
Cibulskis, Kristian ;
Sivachenko, Andrey ;
Stojanov, Petar ;
McKenna, Aaron ;
Lander, Eric S. ;
Gabriel, Stacey ;
Getz, Gad ;
Sougnez, Carrie ;
Imielinski, Marcin ;
Helman, Elena ;
Hernandez, Bryan ;
Pho, Nam H. ;
Meyerson, Matthew ;
Chu, Andy ;
Chun, Hye-Jung E. ;
Mungall, Andrew J. ;
Pleasance, Erin ;
Robertson, A. Gordon ;
Sipahimalani, Payal ;
Stoll, Dominik ;
Balasundaram, Miruna ;
Birol, Inanc ;
Butterfield, Yaron S. N. ;
Chuah, Eric ;
Coope, Robin J. N. ;
Corbett, Richard ;
Dhalla, Noreen ;
Guin, Ranabir ;
Hirst, Anhe Carrie ;
Hirst, Martin ;
Holt, Robert A. ;
Lee, Darlene ;
Li, Haiyan I. ;
Mayo, Michael ;
Moore, Richard A. ;
Mungall, Karen ;
Nip, Ka Ming ;
Olshen, Adam ;
Schein, Jacqueline E. ;
Slobodan, Jared R. ;
Tam, Angela ;
Thiessen, Nina ;
Varhol, Richard ;
Zeng, Thomas ;
Zhao, Yongjun ;
Jones, Steven J. M. .
NATURE, 2012, 489 (7417) :519-525
[9]   Statistical algorithms improve accuracy of gene fusion detection [J].
Hsieh, Gillian ;
Bierman, Rob ;
Szabo, Linda ;
Lee, Alex Gia ;
Freeman, Donald E. ;
Watson, Nathaniel ;
Sweet-Cordero, E. Alejandro ;
Salzman, Julia .
NUCLEIC ACIDS RESEARCH, 2017, 45 (13)
[10]   Implementing Genome-Driven Oncology [J].
Hyman, David M. ;
Taylor, Barry S. ;
Baselga, Jose .
CELL, 2017, 168 (04) :584-599