Accurate eQTL prioritization with an ensemble-based framework

被引:13
作者
Zeng, Haoyang [1 ]
Edwards, Matthew D. [1 ]
Guo, Yuchun [1 ]
Gifford, David K. [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
bioinformatics; eQTL analysis; genetics; machine learning; variation; GENOME-WIDE ASSOCIATION; VARIANTS; ENHANCERS; TRAITS;
D O I
10.1002/humu.23198
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We present a novel ensemble-based computational framework, EnsembleExpr, that achieved the best performance in the Fourth Critical Assessment of Genome Interpretation expression quantitative trait locus (eQTL)-causal SNPs challenge for identifying eQTLs and prioritizing their gene expression effects. eQTLs are genome sequence variants that result in gene expression changes and are thus prime suspects in the search for contributions to the causality of complex traits. When EnsembleExpr is trained on data from massively parallel reporter assays, it accurately predicts reporter expression levels from unseen regulatory sequences and identifies sequence variants that exhibit significant changes in reporter expression. Compared with other state-of-the-art methods, EnsembleExpr achieved competitive performance when applied on eQTL datasets determined by other protocols. We envision EnsembleExpr to be a resource to help interpret noncoding regulatory variants and prioritize disease-associated mutations for downstream validation.
引用
收藏
页码:1259 / 1265
页数:7
相关论文
共 21 条
[1]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[2]  
[Anonymous], 2012, Nature
[3]   Mapping complex disease traits with global gene expression [J].
Cookson, William ;
Liang, Liming ;
Abecasis, Goncalo ;
Moffatt, Miriam ;
Lathrop, Mark .
NATURE REVIEWS GENETICS, 2009, 10 (03) :184-194
[4]   Histone H3K27ac separates active from poised enhancers and predicts developmental state [J].
Creyghton, Menno P. ;
Cheng, Albert W. ;
Welstead, G. Grant ;
Kooistra, Tristan ;
Carey, Bryce W. ;
Steine, Eveline J. ;
Hanna, Jacob ;
Lodato, Michael A. ;
Frampton, Garrett M. ;
Sharp, Phillip A. ;
Boyer, Laurie A. ;
Young, Richard A. ;
Jaenisch, Rudolf .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (50) :21931-21936
[5]   ChromHMM: automating chromatin-state discovery and characterization [J].
Ernst, Jason ;
Kellis, Manolis .
NATURE METHODS, 2012, 9 (03) :215-216
[6]   Human genetic variation and its contribution to complex traits [J].
Frazer, Kelly A. ;
Murray, Sarah S. ;
Schork, Nicholas J. ;
Topol, Eric J. .
NATURE REVIEWS GENETICS, 2009, 10 (04) :241-251
[7]   FunSeq2: a framework for prioritizing noncoding regulatory variants in cancer [J].
Fu, Yao ;
Liu, Zhu ;
Lou, Shaoke ;
Bedford, Jason ;
Mu, Xinmeng Jasmine ;
Yip, Kevin Y. ;
Khurana, Ekta ;
Gerstein, Mark .
GENOME BIOLOGY, 2014, 15 (10) :480
[8]   Potential etiologic and functional implications of genome-wide association loci for human diseases and traits [J].
Hindorff, Lucia A. ;
Sethupathy, Praveen ;
Junkins, Heather A. ;
Ramos, Erin M. ;
Mehta, Jayashri P. ;
Collins, Francis S. ;
Manolio, Teri A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (23) :9362-9367
[9]   A general framework for estimating the relative pathogenicity of human genetic variants [J].
Kircher, Martin ;
Witten, Daniela M. ;
Jain, Preti ;
O'Roak, Brian J. ;
Cooper, Gregory M. ;
Shendure, Jay .
NATURE GENETICS, 2014, 46 (03) :310-+
[10]   Integrative analysis of 111 reference human epigenomes [J].
Kundaje, Anshul ;
Meuleman, Wouter ;
Ernst, Jason ;
Bilenky, Misha ;
Yen, Angela ;
Heravi-Moussavi, Alireza ;
Kheradpour, Pouya ;
Zhang, Zhizhuo ;
Wang, Jianrong ;
Ziller, Michael J. ;
Amin, Viren ;
Whitaker, John W. ;
Schultz, Matthew D. ;
Ward, Lucas D. ;
Sarkar, Abhishek ;
Quon, Gerald ;
Sandstrom, Richard S. ;
Eaton, Matthew L. ;
Wu, Yi-Chieh ;
Pfenning, Andreas R. ;
Wang, Xinchen ;
Claussnitzer, Melina ;
Liu, Yaping ;
Coarfa, Cristian ;
Harris, R. Alan ;
Shoresh, Noam ;
Epstein, Charles B. ;
Gjoneska, Elizabeta ;
Leung, Danny ;
Xie, Wei ;
Hawkins, R. David ;
Lister, Ryan ;
Hong, Chibo ;
Gascard, Philippe ;
Mungall, Andrew J. ;
Moore, Richard ;
Chuah, Eric ;
Tam, Angela ;
Canfield, Theresa K. ;
Hansen, R. Scott ;
Kaul, Rajinder ;
Sabo, Peter J. ;
Bansal, Mukul S. ;
Carles, Annaick ;
Dixon, Jesse R. ;
Farh, Kai-How ;
Feizi, Soheil ;
Karlic, Rosa ;
Kim, Ah-Ram ;
Kulkarni, Ashwinikumar .
NATURE, 2015, 518 (7539) :317-330