Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies

被引:12
作者
Chang, Diana [1 ,2 ]
Keinan, Alon [1 ,2 ]
机构
[1] Cornell Univ, Dept Biol Stat & Computat Biol, Ithaca, NY 14850 USA
[2] Cornell Univ, Program Computat Biol & Med, Ithaca, NY USA
关键词
SYSTEMIC-LUPUS-ERYTHEMATOSUS; INFLAMMATORY-BOWEL-DISEASE; RHEUMATOID-ARTHRITIS; SUSCEPTIBILITY LOCI; PARKINSONS-DISEASE; MULTIPLE-SCLEROSIS; AUTOIMMUNE-DISEASE; GENETIC RISK; IDENTIFIES; METAANALYSIS;
D O I
10.1371/journal.pcbi.1003820
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide association studies (GWASs) have recently revealed many genetic associations that are shared between different diseases. We propose a method, disPCA, for genome-wide characterization of shared and distinct risk factors between and within disease classes. It flips the conventional GWAS paradigm by analyzing the diseases themselves, across GWAS datasets, to explore their "shared pathogenetics''. The method applies principal component analysis (PCA) to gene-level significance scores across all genes and across GWASs, thereby revealing shared pathogenetics between diseases in an unsupervised fashion. Importantly, it adjusts for potential sources of heterogeneity present between GWAS which can confound investigation of shared disease etiology. We applied disPCA to 31 GWASs, including autoimmune diseases, cancers, psychiatric disorders, and neurological disorders. The leading principal components separate these disease classes, as well as inflammatory bowel diseases from other autoimmune diseases. Generally, distinct diseases from the same class tend to be less separated, which is in line with their increased shared etiology. Enrichment analysis of genes contributing to leading principal components revealed pathways that are implicated in the immune system, while also pointing to pathways that have yet to be explored before in this context. Our results point to the potential of disPCA in going beyond epidemiological findings of the co-occurrence of distinct diseases, to highlighting novel genes and pathways that unsupervised learning suggest to be key players in the variability across diseases.
引用
收藏
页数:14
相关论文
共 97 条
[1]   Association Analysis of the Extended MHC Region in Celiac Disease Implicates Multiple Independent Susceptibility Loci [J].
Ahn, Richard ;
Ding, Yuan Chun ;
Murray, Joseph ;
Fasano, Alessio ;
Green, Peter H. R. ;
Neuhausen, Susan L. ;
Garner, Chad .
PLOS ONE, 2012, 7 (05)
[2]   Improved Detection of Common Variants Associated with Schizophrenia and Bipolar Disorder Using Pleiotropy-Informed Conditional False Discovery Rate [J].
Andreassen, Ole A. ;
Thompson, Wesley K. ;
Schork, Andrew J. ;
Ripke, Stephan ;
Mattingsdal, Morten ;
Kelsoe, John R. ;
Kendler, Kenneth S. ;
O'Donovan, Michael C. ;
Rujescu, Dan ;
Werge, Thomas ;
Sklar, Pamela ;
Roddey, J. Cooper ;
Chen, Chi-Hua ;
McEvoy, Linda ;
Desikan, Rahul S. ;
Djurovic, Srdjan ;
Dale, Anders M. .
PLOS GENETICS, 2013, 9 (04)
[3]  
[Anonymous], MOL PSYCHIAT
[4]  
[Anonymous], 1925, STAT METHODS RES WOR
[5]   Genome-wide association analysis of susceptibility and clinical phenotype in multiple sclerosis [J].
Baranzini, Sergio E. ;
Wang, Joanne ;
Gibson, Rachel A. ;
Galwey, Nicholas ;
Naegelin, Yvonne ;
Barkhof, Frederik ;
Radue, Ernst-Wilhelm ;
Lindberg, Raija L. P. ;
Uitdehaag, Bernard M. G. ;
Johnson, Michael R. ;
Angelakopoulou, Aspasia ;
Hall, Leslie ;
Richardson, Jill C. ;
Prinjha, Rab K. ;
Gass, Achim ;
Geurts, Jeroen J. G. ;
Kragt, Jolijn ;
Sombekke, Madeleine ;
Vrenken, Hugo ;
Qualley, Pamela ;
Lincoln, Robin R. ;
Gomez, Refujia ;
Caillier, Stacy J. ;
George, Michaela F. ;
Mousavi, Hourieh ;
Guerrero, Rosa ;
Okuda, Darin T. ;
Cree, Bruce A. C. ;
Green, Ari J. ;
Waubant, Emmanuelle ;
Goodin, Douglas S. ;
Pelletier, Daniel ;
Matthews, Paul M. ;
Hauser, Stephen L. ;
Kappos, Ludwig ;
Polman, Chris H. ;
Oksenberg, Jorge R. .
HUMAN MOLECULAR GENETICS, 2009, 18 (04) :767-778
[6]   Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region [J].
Barrett, Jeffrey C. ;
Lee, James C. ;
Lees, Charles W. ;
Prescott, Natalie J. ;
Anderson, Carl A. ;
Phillips, Anne ;
Wesley, Emma ;
Parnell, Kirstie ;
Zhang, Hu ;
Drummond, Hazel ;
Nimmo, Elaine R. ;
Massey, Dunecan ;
Blaszczyk, Kasia ;
Elliott, Timothy ;
Cotterill, Lynn ;
Dallal, Helen ;
Lobo, Alan J. ;
Mowat, Craig ;
Sanderson, Jeremy D. ;
Jewell, Derek P. ;
Newman, William G. ;
Edwards, Cathryn ;
Ahmad, Tariq ;
Mansfield, John C. ;
Satsangi, Jack ;
Parkes, Miles ;
Mathew, Christopher G. ;
Donnelly, Peter ;
Peltonen, Leena ;
Blackwell, Jenefer M. ;
Bramon, Elvira ;
Brown, Matthew A. ;
Casas, Juan P. ;
Corvin, Aiden ;
Craddock, Nicholas ;
Deloukas, Panos ;
Duncanson, Audrey ;
Jankowski, Janusz ;
Markus, Hugh S. ;
McCarthy, Mark I. ;
Palmer, Colin N. A. ;
Plomin, Robert ;
Rautanen, Anna ;
Sawcer, Stephen J. ;
Samani, Nilesh ;
Trembath, Richard C. ;
Viswanathan, Ananth C. ;
Wood, Nicholas ;
Spencer, Chris C. A. ;
Bellenguez, Celine .
NATURE GENETICS, 2009, 41 (12) :1330-U99
[7]   Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes [J].
Barrett, Jeffrey C. ;
Clayton, David G. ;
Concannon, Patrick ;
Akolkar, Beena ;
Cooper, Jason D. ;
Erlich, Henry A. ;
Julier, Cecile ;
Morahan, Grant ;
Nerup, Jorn ;
Nierras, Concepcion ;
Plagnol, Vincent ;
Pociot, Flemming ;
Schuilenburg, Helen ;
Smyth, Deborah J. ;
Stevens, Helen ;
Todd, John A. ;
Walker, Neil M. ;
Rich, Stephen S. .
NATURE GENETICS, 2009, 41 (06) :703-707
[8]   Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture [J].
Berndt, Sonja I. ;
Gustafsson, Stefan ;
Maegi, Reedik ;
Ganna, Andrea ;
Wheeler, Eleanor ;
Feitosa, Mary F. ;
Justice, Anne E. ;
Monda, Keri L. ;
Croteau-Chonka, Damien C. ;
Day, Felix R. ;
Esko, Tonu ;
Fall, Tove ;
Ferreira, Teresa ;
Gentilini, Davide ;
Jackson, Anne U. ;
Luan, Jian'an ;
Randall, Joshua C. ;
Vedantam, Sailaja ;
Willer, Cristen J. ;
Winkler, Thomas W. ;
Wood, Andrew R. ;
Workalemahu, Tsegaselassie ;
Hu, Yi-Juan ;
Lee, Sang Hong ;
Liang, Liming ;
Lin, Dan-Yu ;
Min, Josine L. ;
Neale, Benjamin M. ;
Thorleifsson, Gudmar ;
Yang, Jian ;
Albrecht, Eva ;
Amin, Najaf ;
Bragg-Gresham, Jennifer L. ;
Cadby, Gemma ;
den Heijer, Martin ;
Eklund, Niina ;
Fischer, Krista ;
Goel, Anuj ;
Hottenga, Jouke-Jan ;
Huffman, Jennifer E. ;
Jarick, Ivonne ;
Johansson, Asa ;
Johnson, Toby ;
Kanoni, Stavroula ;
Kleber, Marcus E. ;
Koenig, Inke R. ;
Kristiansson, Kati ;
Kutalik, Zoltn ;
Lamina, Claudia ;
Lecoeur, Cecile .
NATURE GENETICS, 2013, 45 (05) :501-U69
[9]   Genome-wide association of major depression: description of samples for the GAIN Major Depressive Disorder Study: NTR and NESDA biobank projects [J].
Boomsma, Dorret I. ;
Willemsen, Gonneke ;
Sullivan, Patrick F. ;
Heutink, Peter ;
Meijer, Piet ;
Sondervan, David ;
Kluft, Cornelis ;
Smit, Guus ;
Nolen, Willem A. ;
Zitman, Frans G. ;
Smit, Johannes H. ;
Hoogendijk, Witte J. ;
van Dyck, Richard ;
De Geus, Eco J. C. ;
Penninx, Brenda W. J. H. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2008, 16 (03) :335-342
[10]   Diversity of antibody-mediated immunity at the mucosal barrier [J].
Bouvet, JP ;
Fischetti, VA .
INFECTION AND IMMUNITY, 1999, 67 (06) :2687-2691