Analysis commons, a team approach to discovery in a big-data environment for genetic epidemiology

被引:70
作者
Brody, Jennifer A. [1 ]
Morrison, Alanna C. [2 ]
Bis, Joshua C. [1 ]
O'Connell, Jeffrey R. [3 ]
Brown, Michael R. [2 ]
Huffman, Jennifer E. [4 ,5 ]
Ames, Darren C. [6 ]
Carroll, Andrew [6 ]
Conomos, Matthew P. [7 ]
Gabriel, Stacey [8 ]
Gibbs, Richard A. [9 ]
Gogarten, Stephanie M. [7 ]
Gupta, Namrata [8 ]
Jaquish, Cashell E. [10 ]
Johnson, Andrew D. [4 ,5 ]
Lewis, Joshua P. [3 ]
Liu, Xiaoming [2 ]
Manning, Alisa K. [11 ,12 ,13 ,14 ]
Papanicolaou, George J. [10 ]
Pitsillides, Achilleas N. [4 ,5 ]
Rice, Kenneth M. [7 ]
Salerno, William [9 ]
Sitlani, Colleen M. [1 ]
Smith, Nicholas L. [1 ,15 ,16 ,17 ]
Heckbert, Susan R. [1 ,17 ]
Laurie, Cathy C. [7 ]
Mitchell, Braxton D. [3 ,18 ]
Vasan, Ramachandran S. [4 ,5 ,19 ,20 ,21 ]
Rich, Stephen S. [22 ]
Rotter, Jerome I. [23 ,24 ]
Wilson, James G. [25 ]
Boerwinkle, Eric [2 ,9 ]
Psaty, Bruce M. [1 ,15 ,26 ,27 ]
Cupples, L. Adrienne [4 ,5 ,28 ]
机构
[1] Univ Washington, Dept Med, Cardiovasc Hlth Res Unit, Seattle, WA 98195 USA
[2] Univ Texas Hlth Sci Ctr Houston, Sch Publ Hlth, Dept Epidemiol Human Genet & Environm Sci, Ctr Human Genet, Houston, TX 77030 USA
[3] Univ Maryland, Dept Med, Div Endocrinol Diabet & Nutr, Baltimore, MD 21201 USA
[4] NHLBI, Framingham Heart Study, Framingham, MA USA
[5] Boston Univ, Framingham, MA USA
[6] DNAnexus Inc, Mountain View, CA USA
[7] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[8] Broad Inst, Program Med & Populat Genet, Cambridge, MA USA
[9] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
[10] NHLBI, Div Cardiovasc Sci, Bldg 10, Bethesda, MD 20892 USA
[11] Massachusetts Gen Hosp, Ctr Human Genet Res, Boston, MA 02114 USA
[12] Broad Inst Harvard, Program Med & Populat Genet, Cambridge, MA USA
[13] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[14] Harvard Med Sch, Dept Med, Boston, MA USA
[15] Kaiser Permanente Washington Hlth Res Inst, Seattle, WA USA
[16] Seattle Epidemiol Res & Informat Ctr, Dept Vet Affairs, Off Res & Dev, Seattle, WA USA
[17] Univ Washington, Dept Epidemiol, Seattle, WA 98195 USA
[18] Baltimore Vet Adm Med Ctr, Geriatr Res & Educ Clin Ctr, Baltimore, MD USA
[19] Boston Univ, Sch Med, Dept Med, Prevent Med & Epidemiol Sect, Boston, MA 02118 USA
[20] Boston Univ, Sch Med, Dept Med, Cardiol Sect, Boston, MA 02118 USA
[21] Boston Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA USA
[22] Univ Virginia, Ctr Publ Hlth Genom, Charlottesville, VA USA
[23] Univ Calif Los Angeles, Med Ctr, Inst Translat Genom & Populat Sci, Dept Pediat,LABioMed Harbor, Torrance, CA 90509 USA
[24] Univ Calif Los Angeles, Med Ctr, Inst Translat Genom & Populat Sci, Dept Med,LABioMed Harbor, Torrance, CA 90509 USA
[25] Univ Mississippi, Med Ctr, Dept Physiol & Biophys, Jackson, MS 39216 USA
[26] Univ Washington, Dept Med, Seattle, WA USA
[27] Univ Washington, Dept Epidemiol & Hlth Serv, Seattle, WA 98195 USA
[28] Boston Univ, Sch Publ Hlth, Dept Biostat, Boston, MA USA
基金
美国国家卫生研究院;
关键词
ASSOCIATION; GENOME; VARIANTS; RARE;
D O I
10.1038/ng.3968
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The increasing volume of whole-genome sequence (WGS) and multi-omics data requires new approaches for analysis. As one solution, we have created the cloud-based Analysis Commons, which brings together genotype and phenotype data from multiple studies in a setting that is accessible by multiple investigators. This framework addresses many of the challenges of multicenter WGS analyses, including data-sharing mechanisms, phenotype harmonization, integrated multi-omics analyses, annotation and computational flexibility. In this setting, the computational pipeline facilitates a sequence-to-discovery analysis workflow illustrated here by an analysis of plasma fibrinogen levels in 3,996 individuals from the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) WGS program. The Analysis Commons represents a novel model for translating WGS resources from a massive quantity of phenotypic and genomic data into knowledge of the determinants of health and disease risk in diverse human populations.
引用
收藏
页码:1560 / 1563
页数:4
相关论文
共 12 条
[1]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[2]   The genetic architecture of type 2 diabetes [J].
Fuchsberger, Christian ;
Flannick, Jason ;
Teslovich, Tanya M. ;
Mahajan, Anubha ;
Agarwala, Vineeta ;
Gaulton, Kyle J. ;
Ma, Clement ;
Fontanillas, Pierre ;
Moutsianas, Loukas ;
McCarthy, Davis J. ;
Rivas, Manuel A. ;
Perry, John R. B. ;
Sim, Xueling ;
Blackwell, Thomas W. ;
Robertson, Neil R. ;
Rayner, N. William ;
Cingolani, Pablo ;
Locke, Adam E. ;
Tajes, Juan Fernandez ;
Highland, Heather M. ;
Dupuis, Josee ;
Chines, Peter S. ;
Lindgren, Cecilia M. ;
Hartl, Christopher ;
Jackson, Anne U. ;
Chen, Han ;
Huyghe, Jeroen R. ;
van de Bunt, Martijn ;
Pearson, Richard D. ;
Kumar, Ashish ;
Mueller-Nurasyid, Martina ;
Grarup, Niels ;
Stringham, Heather M. ;
Gamazon, Eric R. ;
Lee, Jaehoon ;
Chen, Yuhui ;
Scott, Robert A. ;
Below, Jennifer E. ;
Chen, Peng ;
Huang, Jinyan ;
Go, Min Jin ;
Stitzel, Michael L. ;
Pasko, Dorota ;
Parker, Stephen C. J. ;
Varga, Tibor V. ;
Green, Todd ;
Beer, Nicola L. ;
Day-Williams, Aaron G. ;
Ferreira, Teresa ;
Fingerlin, Tasha .
NATURE, 2016, 536 (7614) :41-+
[3]   Rare and low-frequency variants and their association with plasma levels of fibrinogen, FVII, FVIII, and vWF [J].
Huffman, Jennifer E. ;
de Vries, Paul S. ;
Morrison, Alanna C. ;
Sabater-Lleal, Maria ;
Kacprowski, Tim ;
Auer, Paul L. ;
Brody, Jennifer A. ;
Chasman, Daniel I. ;
Chen, Ming-Huei ;
Guo, Xiuqing ;
Lin, Li-An ;
Marioni, Riccardo E. ;
Mueller-Nurasyid, Martina ;
Yanek, Lisa R. ;
Pankratz, Nathan ;
Grove, Megan L. ;
de Maat, Moniek P. M. ;
Cushman, Mary ;
Wiggins, Kerri L. ;
Qi, Lihong ;
Sennblad, Bengt ;
Harris, Sarah E. ;
Polasek, Ozren ;
Riess, Helene ;
Rivadeneira, Fernando ;
Rose, Lynda M. ;
Goel, Anuj ;
Taylor, Kent D. ;
Teumer, Alexander ;
Uitterlinden, Andre G. ;
Vaidya, Dhananjay ;
Yao, Jie ;
Tang, Weihong ;
Levy, Daniel ;
Waldenberger, Melanie ;
Becker, Diane M. ;
Folsom, Aaron R. ;
Giulianini, Franco ;
Greinacher, Andreas ;
Hofman, Albert ;
Huang, Chiang-Ching ;
Kooperberg, Charles ;
Silveira, Angela ;
Starr, John M. ;
Strauch, Konstantin ;
Strawbridge, Rona J. ;
Wright, Alan F. ;
McKnight, Barbara ;
Franco, Oscar H. ;
Zakai, Neil .
BLOOD, 2015, 126 (11) :E19-E29
[4]   A general framework for estimating the relative pathogenicity of human genetic variants [J].
Kircher, Martin ;
Witten, Daniela M. ;
Jain, Preti ;
O'Roak, Brian J. ;
Cooper, Gregory M. ;
Shendure, Jay .
NATURE GENETICS, 2014, 46 (03) :310-+
[5]   WGSA: an annotation pipeline for human genome sequencing studies [J].
Liu, Xiaoming ;
White, Simon ;
Peng, Bo ;
Johnson, Andrew D. ;
Brody, Jennifer A. ;
Li, Alexander H. ;
Huang, Zhuoyi ;
Carroll, Andrew ;
Wei, Peng ;
Gibbs, Richard ;
Klein, Robert J. ;
Boerwinkle, Eric .
JOURNAL OF MEDICAL GENETICS, 2016, 53 (02) :111-112
[6]  
Lumley T., 2016, PREPRINT
[7]   Whole-genome sequence-based analysis of high-density lipoprotein cholesterol [J].
Morrison, Alanna C. ;
Voorman, Arend ;
Johnson, Andrew D. ;
Liu, Xiaoming ;
Yu, Jin ;
Li, Alexander ;
Muzny, Donna ;
Yu, Fuli ;
Rice, Kenneth ;
Zhu, Chengsong ;
Bis, Joshua ;
Heiss, Gerardo ;
O'Donnell, Christopher J. ;
Psaty, Bruce M. ;
Cupples, L. Adrienne ;
Gibbs, Richard ;
Boerwinkle, Eric .
NATURE GENETICS, 2013, 45 (08) :899-U29
[8]   Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Design of Prospective Meta-Analyses of Genome-Wide Association Studies From 5 Cohorts [J].
Psaty, Bruce M. ;
O'Donnell, Christopher J. ;
Gudnason, Vilmundur ;
Lunetta, Kathryn L. ;
Folsom, Aaron R. ;
Rotter, Jerome I. ;
Uitterlinden, Andre G. ;
Harris, Tamara B. ;
Witteman, Jacqueline C. M. ;
Boerwinkle, Eric .
CIRCULATION-CARDIOVASCULAR GENETICS, 2009, 2 (01) :73-U128
[9]   Launching genomics into the cloud: deployment of Mercury, a next generation sequence analysis pipeline [J].
Reid, Jeffrey G. ;
Carroll, Andrew ;
Veeraraghavan, Narayanan ;
Dahdouli, Mahmoud ;
Sundquist, Andreas ;
English, Adam ;
Bainbridge, Matthew ;
White, Simon ;
Salerno, William ;
Buhay, Christian ;
Yu, Fuli ;
Muzny, Donna ;
Daly, Richard ;
Duyk, Geoff ;
Gibbs, Richard A. ;
Boerwinkle, Eric .
BMC BIOINFORMATICS, 2014, 15
[10]   The Precision Medicine Initiative's All of Us Research Program: an agenda for research on its ethical, legal, and social issues [J].
Sankar, Pamela L. ;
Parker, Lisa S. .
GENETICS IN MEDICINE, 2017, 19 (07) :743-750