Predicting functional variants in enhancer and promoter elements using RegulomeDB

被引:72
作者
Dong, Shengcheng [1 ]
Boyle, Alan P. [1 ,2 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Human Genet, Ann Arbor, MI 48109 USA
关键词
functional genomics; gene regulation; machine learning; MPRA; variation; REGULATORY VARIANTS; DNA; SPECIFICITIES; RESOURCE; DATABASE;
D O I
10.1002/humu.23791
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Here we present a computational model, Score of Unified Regulatory Features (SURF), that predicts functional variants in enhancer and promoter elements. SURF is trained on data from massively parallel reporter assays and predicts the effect of variants on reporter expression levels. It achieved the top performance in the Fifth Critical Assessment of Genome Interpretation "Regulation Saturation" challenge. We also show that features queried through RegulomeDB, which are direct annotations from functional genomics data, help improve prediction accuracy beyond transfer learning features from DNA sequence-based deep learning models. Some of the most important features include DNase footprints, especially when coupled with complementary ChIP-seq data. Furthermore, we found our model achieved good performance in predicting allele-specific transcription factor binding events. As an extension to the current scoring system in RegulomeDB, we expect our computational model to prioritize variants in regulatory regions, thus help the understanding of functional variants in noncoding regions that lead to disease.
引用
收藏
页码:1292 / 1298
页数:7
相关论文
共 29 条
  • [1] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [2] Annotation of functional variation in personal genomes using RegulomeDB
    Boyle, Alan P.
    Hong, Eurie L.
    Hariharan, Manoj
    Cheng, Yong
    Schaub, Marc A.
    Kasowski, Maya
    Karczewski, Konrad J.
    Park, Julie
    Hitz, Benjamin C.
    Weng, Shuai
    Cherry, J. Michael
    Snyder, Michael
    [J]. GENOME RESEARCH, 2012, 22 (09) : 1790 - 1797
  • [3] High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells
    Boyle, Alan P.
    Song, Lingyun
    Lee, Bum-Kyu
    London, Darin
    Keefe, Damian
    Birney, Ewan
    Iyer, Vishwanath R.
    Crawford, Gregory E.
    Furey, Terrence S.
    [J]. GENOME RESEARCH, 2011, 21 (03) : 456 - 464
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update
    Bryne, Jan Christian
    Valen, Eivind
    Tang, Man-Hung Eric
    Marstrand, Troels
    Winther, Ole
    da Piedade, Isabelle
    Krogh, Anders
    Lenhard, Boris
    Sandelin, Albin
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D102 - D106
  • [6] A uniform survey of allele-specific binding and expression over 1000-Genomes-Project individuals
    Chen, Jieming
    Rozowsky, Joel
    Galeev, Timur R.
    Harmanci, Arif
    Kitchen, Robert
    Bedford, Jason
    Abyzov, Alexej
    Kong, Yong
    Regan, Lynne
    Gerstein, Mark
    [J]. NATURE COMMUNICATIONS, 2016, 7
  • [7] Potential etiologic and functional implications of genome-wide association loci for human diseases and traits
    Hindorff, Lucia A.
    Sethupathy, Praveen
    Junkins, Heather A.
    Ramos, Erin M.
    Mehta, Jayashri P.
    Collins, Francis S.
    Manolio, Teri A.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (23) : 9362 - 9367
  • [8] Super-Enhancers in the Control of Cell Identity and Disease
    Hnisz, Denes
    Abraham, Brian J.
    Lee, Tong Ihn
    Lau, Ashley
    Saint-Andre, Violaine
    Sigova, Alla A.
    Hoke, Heather A.
    Young, Richard A.
    [J]. CELL, 2013, 155 (04) : 934 - 947
  • [9] Decoding enhancers using massively parallel reporter assays
    Inoue, Fumitaka
    Ahituv, Nadav
    [J]. GENOMICS, 2015, 106 (03) : 159 - 164
  • [10] DNA-Binding Specificities of Human Transcription Factors
    Jolma, Arttu
    Yan, Jian
    Whitington, Thomas
    Toivonen, Jarkko
    Nitta, Kazuhiro R.
    Rastas, Pasi
    Morgunova, Ekaterina
    Enge, Martin
    Taipale, Mikko
    Wei, Gonghong
    Palin, Kimmo
    Vaquerizas, Juan M.
    Vincentelli, Renaud
    Luscombe, Nicholas M.
    Hughes, Timothy R.
    Lemaire, Patrick
    Ukkonen, Esko
    Kivioja, Teemu
    Taipale, Jussi
    [J]. CELL, 2013, 152 (1-2) : 327 - 339