FUN-LDA: A Latent Dirichlet Allocation Model for Predicting Tissue-Specific Functional Effects of Noncoding Variation: Methods and Applications

被引:55
作者
Backenroth, Daniel [1 ]
He, Zihuai [1 ]
Kiryluk, Krzysztof [2 ]
Boeva, Valentina [3 ,4 ]
Pethukova, Lynn [5 ,6 ]
Khurana, Ekta [7 ]
Christiano, Angela [6 ,8 ]
Buxbaum, Joseph D. [9 ,10 ,11 ,12 ,13 ,14 ]
Ionita-Laza, Iuliana [1 ]
机构
[1] Columbia Univ, Dept Biostat, New York, NY 10032 USA
[2] Columbia Univ, Dept Med, New York, NY 10032 USA
[3] INSERM, U900, F-75005 Paris, France
[4] PSL Res Univ, Inst Curie, Mines ParisTech, F-75005 Paris, France
[5] Columbia Univ, Dept Epidemiol, New York, NY 10032 USA
[6] Columbia Univ, Dept Dermatol, New York, NY 10032 USA
[7] Cornell Univ, Weill Med Coll, Dept Physiol & Biophys, New York, NY 10021 USA
[8] Columbia Univ, Dept Genet & Dev, New York, NY 10032 USA
[9] Icahn Sch Med Mt Sinai, Dept Psychiat, New York, NY 10029 USA
[10] Icahn Sch Med Mt Sinai, Dept Neurosci, New York, NY 10029 USA
[11] Icahn Sch Med Mt Sinai, Dept Genet, New York, NY 10029 USA
[12] Icahn Sch Med Mt Sinai, Dept Genom Sci, New York, NY 10029 USA
[13] Icahn Sch Med Mt Sinai, Friedman Brain Inst, New York, NY 10029 USA
[14] Icahn Sch Med Mt Sinai, Mindich Child Hlth & Dev Inst, New York, NY 10029 USA
基金
美国国家卫生研究院;
关键词
TRANSCRIPTION FACTOR-BINDING; GENOME-WIDE ASSOCIATION; SYSTEMATIC DISSECTION; REGULATORY VARIANTS; GENETIC ARCHITECTURE; SUSCEPTIBILITY LOCI; SEQUENCE VARIANTS; DNA ELEMENTS; RISK LOCI; IDENTIFICATION;
D O I
10.1016/j.ajhg.2018.03.026
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We describe a method based on a latent Dirichlet allocation model for predicting functional effects of noncoding genetic variants in a cell-type- and/or tissue-specific way (FUN-LDA). Using this unsupervised approach, we predict tissue-specific functional effects for every position in the human genome in 127 different tissues and cell types. We demonstrate the usefulness of our predictions by using several validation experiments. Using eQTL data from several sources, including the GTEx project, Geuvadis project, and TwinsUK cohort, we show that eQTLs in specific tissues tend to be most enriched among the predicted functional variants in relevant tissues in Roadmap. We further show how these integrated functional scores can be used for (1) deriving the most likely cell or tissue type causally implicated for a complex trait by using summary statistics from genome-wide association studies and (2) estimating a tissue-based correlation matrix of various complex traits. We found large enrichment of heritability in functional components of relevant tissues for various complex traits, and FUN-LDA yielded higher enrichment estimates than existing methods. Finally, using experimentally validated functional variants from the literature and variants possibly implicated in disease by previous studies, we rigorously compare FUN-LDA with state-ofthe-art functional annotation methods and show that FUN-LDA has better prediction accuracy and higher resolution than these methods. In particular, our results suggest that tissue- and cell-type-specific functional prediction methods tend to have substantially better prediction accuracy than organism-level prediction methods. Scores for each position in the human genome and for each ENCODE and Roadmap tissue are available online (see feb Resources).
引用
收藏
页码:920 / 942
页数:23
相关论文
共 86 条
[1]   Genetic effects on gene expression across human tissues [J].
Aguet, Francois ;
Brown, Andrew A. ;
Castel, Stephane E. ;
Davis, Joe R. ;
He, Yuan ;
Jo, Brian ;
Mohammadi, Pejman ;
Park, Yoson ;
Parsana, Princy ;
Segre, Ayellet V. ;
Strober, Benjamin J. ;
Zappala, Zachary ;
Cummings, Beryl B. ;
Gelfand, Ellen T. ;
Hadley, Kane ;
Huang, Katherine H. ;
Lek, Monkol ;
Li, Xiao ;
Nedzel, Jared L. ;
Nguyen, Duyen Y. ;
Noble, Michael S. ;
Sullivan, Timothy J. ;
Tukiainen, Taru ;
MacArthur, Daniel G. ;
Getz, Gad ;
Management, Nih Program ;
Addington, Anjene ;
Guan, Ping ;
Koester, Susan ;
Little, A. Roger ;
Lockhart, Nicole C. ;
Moore, Helen M. ;
Rao, Abhi ;
Struewing, Jeffery P. ;
Volpi, Simona ;
Collection, Biospecimen ;
Brigham, Lori E. ;
Hasz, Richard ;
Hunter, Marcus ;
Johns, Christopher ;
Johnson, Mark ;
Kopen, Gene ;
Leinweber, William F. ;
Lonsdale, John T. ;
McDonald, Alisa ;
Mestichelli, Bernadette ;
Myer, Kevin ;
Roe, Bryan ;
Salvatore, Michael ;
Shad, Saboor .
NATURE, 2017, 550 (7675) :204-+
[2]   Hundreds of variants clustered in genomic loci and biological pathways affect human height [J].
Allen, Hana Lango ;
Estrada, Karol ;
Lettre, Guillaume ;
Berndt, Sonja I. ;
Weedon, Michael N. ;
Rivadeneira, Fernando ;
Willer, Cristen J. ;
Jackson, Anne U. ;
Vedantam, Sailaja ;
Raychaudhuri, Soumya ;
Ferreira, Teresa ;
Wood, Andrew R. ;
Weyant, Robert J. ;
Segre, Ayellet V. ;
Speliotes, Elizabeth K. ;
Wheeler, Eleanor ;
Soranzo, Nicole ;
Park, Ju-Hyun ;
Yang, Jian ;
Gudbjartsson, Daniel ;
Heard-Costa, Nancy L. ;
Randall, Joshua C. ;
Qi, Lu ;
Smith, Albert Vernon ;
Maegi, Reedik ;
Pastinen, Tomi ;
Liang, Liming ;
Heid, Iris M. ;
Luan, Jian'an ;
Thorleifsson, Gudmar ;
Winkler, Thomas W. ;
Goddard, Michael E. ;
Lo, Ken Sin ;
Palmer, Cameron ;
Workalemahu, Tsegaselassie ;
Aulchenko, Yurii S. ;
Johansson, Asa ;
Zillikens, M. Carola ;
Feitosa, Mary F. ;
Esko, Tonu ;
Johnson, Toby ;
Ketkar, Shamika ;
Kraft, Peter ;
Mangino, Massimo ;
Prokopenko, Inga ;
Absher, Devin ;
Albrecht, Eva ;
Ernst, Florian ;
Glazer, Nicole L. ;
Hayward, Caroline .
NATURE, 2010, 467 (7317) :832-838
[3]   Genetic Mapping in Human Disease [J].
Altshuler, David ;
Daly, Mark J. ;
Lander, Eric S. .
SCIENCE, 2008, 322 (5903) :881-888
[4]   Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies [J].
Anney, Richard J. L. ;
Avbersek, Andreja ;
Balding, David ;
Baum, Larry ;
Becker, Felicitas ;
Berkovic, Samuel F. ;
Bradfield, Jonathan P. ;
Brody, Lawrence C. ;
Buono, Russell J. ;
Catarino, Claudia B. ;
Cavalleri, Gianpiero L. ;
Cherny, Stacey S. ;
Chinthapalli, Krishna ;
Coffey, Alison J. ;
Compston, Alastair ;
Cossette, Patrick ;
de Haan, Gerrit-Jan ;
De Jonghe, Peter ;
de Kovel, Carolien G. F. ;
Delanty, Norman ;
Depondt, Chantal ;
Dlugos, Dennis J. ;
Doherty, Colin P. ;
Elger, Christian E. ;
Ferraro, Thomas N. ;
Feucht, Martha ;
Franke, Andre ;
French, Jacqueline ;
Gaus, Verena ;
Goldstein, David B. ;
Gui, Hongsheng ;
Guo, Youling ;
Hakonarson, Hakon ;
Hallmann, Kerstin ;
Heinzen, Erin L. ;
Helbig, Ingo ;
Hjalgrim, Helle ;
Jackson, Margaret ;
Jamnadas-Khoda, Jennifer ;
Janz, Dieter ;
Johnson, Michael R. ;
Kaelviaeinen, Reetta ;
Kantanen, Anne-Mari ;
Kasperaviciute, Dalia ;
Trenite, Dorothee Kasteleijn-Nolst ;
Koeleman, Bobby P. C. ;
Kunz, Wolfram S. ;
Kwan, Patrick ;
Lau, Yu Lung ;
Lehesjoki, Anna-Elina .
LANCET NEUROLOGY, 2014, 13 (09) :893-903
[5]  
[Anonymous], BIORXIV
[6]  
[Anonymous], BIORXIV
[7]   The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans [J].
Ardlie, Kristin G. ;
DeLuca, David S. ;
Segre, Ayellet V. ;
Sullivan, Timothy J. ;
Young, Taylor R. ;
Gelfand, Ellen T. ;
Trowbridge, Casandra A. ;
Maller, Julian B. ;
Tukiainen, Taru ;
Lek, Monkol ;
Ward, Lucas D. ;
Kheradpour, Pouya ;
Iriarte, Benjamin ;
Meng, Yan ;
Palmer, Cameron D. ;
Esko, Tonu ;
Winckler, Wendy ;
Hirschhorn, Joel N. ;
Kellis, Manolis ;
MacArthur, Daniel G. ;
Getz, Gad ;
Shabalin, Andrey A. ;
Li, Gen ;
Zhou, Yi-Hui ;
Nobel, Andrew B. ;
Rusyn, Ivan ;
Wright, Fred A. ;
Lappalainen, Tuuli ;
Ferreira, Pedro G. ;
Ongen, Halit ;
Rivas, Manuel A. ;
Battle, Alexis ;
Mostafavi, Sara ;
Monlong, Jean ;
Sammeth, Michael ;
Mele, Marta ;
Reverter, Ferran ;
Goldmann, Jakob M. ;
Koller, Daphne ;
Guigo, Roderic ;
McCarthy, Mark I. ;
Dermitzakis, Emmanouil T. ;
Gamazon, Eric R. ;
Im, Hae Kyung ;
Konkashbaev, Anuar ;
Nicolae, Dan L. ;
Cox, Nancy J. ;
Flutre, Timothee ;
Wen, Xiaoquan ;
Stephens, Matthew .
SCIENCE, 2015, 348 (6235) :648-660
[8]  
Attias H, 1999, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, P21
[9]   Regulation of chromatin by histone modifications [J].
Bannister, Andrew J. ;
Kouzarides, Tony .
CELL RESEARCH, 2011, 21 (03) :381-395
[10]   Genome-wide meta-analysis in alopecia areata resolves HLA associations and reveals two new susceptibility loci [J].
Betz, Regina C. ;
Petukhova, Lynn ;
Ripke, Stephan ;
Huang, Hailiang ;
Menelaou, Androniki ;
Redler, Silke ;
Becker, Tim ;
Heilmann, Stefanie ;
Yamany, Tarek ;
Duvic, Madeliene ;
Hordinsky, Maria ;
Norris, David ;
Price, Vera H. ;
Mackay-Wiggan, Julian ;
de Jong, Annemieke ;
DeStefano, Gina M. ;
Moebus, Susanne ;
Boehm, Markus ;
Blume-Peytavi, Ulrike ;
Wolff, Hans ;
Lutz, Gerhard ;
Kruse, Roland ;
Bian, Li ;
Amos, Christopher I. ;
Lee, Annette ;
Gregersen, Peter K. ;
Blaumeiser, Bettina ;
Altshuler, David ;
Clynes, Raphael ;
de Bakker, Paul I. W. ;
Noethen, Markus M. ;
Daly, Mark J. ;
Christiano, Angela M. .
NATURE COMMUNICATIONS, 2015, 6