Gene expression models based on transcription factor binding events confer insight into functional cis-regulatory variants

被引:11
|
作者
Shi, Wenqiang [1 ,2 ,3 ]
Fornes, Oriol [1 ]
Wasserman, Wyeth W. [1 ]
机构
[1] Univ British Columbia, Dept Med Genet, Ctr Mol Med & Therapeut, BC Childrens Hosp Res Inst, Vancouver, BC V5Z 4H4, Canada
[2] Univ British Columbia, Bioinformat Grad Program, Vancouver, BC V6T 1Z4, Canada
[3] Beijing Inst Microbiol & Epidemiol, Beijing 100071, Peoples R China
基金
加拿大自然科学与工程研究理事会; 美国国家卫生研究院; 加拿大健康研究院;
关键词
D O I
10.1093/bioinformatics/bty992
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Deciphering the functional roles of cis-regulatory variants is a critical challenge in genome analysis and interpretation. It has been hypothesized that altered transcription factor (TF) binding events are a central mechanism by which cis-regulatory variants impact gene expression levels. However, we lack a computational framework to understand and quantify such mechanistic contributions. Results We present TF2Exp, a gene-based framework to predict the impact of altered TF-binding events on gene expression levels. Using data from lymphoblastoid cell lines, TF2Exp models were applied successfully to predict the expression levels of 3196 genes. Alterations within DNase I hypersensitive, CTCF-bound and tissue-specific TF-bound regions were the greatest contributing features to the models. TF2Exp models performed as well as models based on common variants, both in cross-validation and external validation. Combining TF alteration and common variant features can further improve model performance. Unlike variant-based models, TF2Exp models have the unique advantage to evaluate the functional impact of variants in linkage disequilibrium and uncommon variants. We find that adding TF-binding events altered only by uncommon variants could increase the number of predictable genes (R-2 > 0.05). Taken together, TF2Exp represents a key step towards interpreting the functional roles of cis-regulatory variants in the human genome. Availability and implementation The code and model training results are publicly available at https://github.com/wqshi/TF2Exp. Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:2610 / 2617
页数:8
相关论文
共 50 条
  • [1] Cis-regulatory variants affect gene expression dynamics in yeast
    Shih, Ching-Hua
    Fay, Justin
    ELIFE, 2021, 10
  • [2] Mammalian evolution of human cis-regulatory elements and transcription factor binding sites
    Andrews, Gregory
    Fan, Kaili
    Pratt, Henry E.
    Phalke, Nishigandha
    Karlsson, Elinor K.
    Lindblad-Toh, Kerstin
    Gazal, Steven
    Moore, Jill E.
    Weng, Zhiping
    SCIENCE, 2023, 380 (6643)
  • [3] A cis-Regulatory Signature in Ascidians and Flies, Independent of Transcription Factor Binding Sites
    Khoueiry, Pierre
    Rothbacher, Ute
    Ohtsuka, Yukio
    Daian, Fabrice
    Frangulian, Eric
    Roure, Agnes
    Dubchak, Irina
    Lemaire, Patrick
    CURRENT BIOLOGY, 2010, 20 (09) : 792 - 802
  • [4] Evolutionary Potential of Cis-Regulatory Mutations to Cause Rapid Changes in Transcription Factor Binding
    Kurafeiski, Jasmin D.
    Pinto, Paulo
    Bornberg-Bauer, Erich
    GENOME BIOLOGY AND EVOLUTION, 2019, 11 (02): : 406 - 414
  • [5] Uncovering cis-regulatory sequence requirements for context-specific transcription factor binding
    Yanez-Cuna, J. Omar
    Dinh, Huy Q.
    Kvon, Evgeny Z.
    Shlyueva, Daria
    Stark, Alexander
    GENOME RESEARCH, 2012, 22 (10) : 2018 - 2030
  • [6] Is Transcription Factor Binding Site Turnover a Sufficient Explanation for Cis-Regulatory Sequence Divergence?
    Venkataram, Sandeep
    Fay, Justin C.
    GENOME BIOLOGY AND EVOLUTION, 2010, 2 : 851 - 858
  • [7] Systematic identification of cis-regulatory variants that cause gene expression differences in a yeast cross
    Renganaath, Kaushik
    Cheung, Rocky
    Day, Laura
    Kosuri, Sriram
    Kruglyak, Leonid
    Albert, Frank W.
    ELIFE, 2020, 9 : 1 - 35
  • [8] A map of cis-regulatory modules and constituent transcription factor binding sites in 80% of the mouse genome
    Ni, Pengyu
    Wilson, David
    Su, Zhengchang
    BMC GENOMICS, 2022, 23 (01)
  • [9] REDfly 2.0:: an integrated database of cis-regulatory modules and transcription factor binding sites in Drosophila
    Halfon, Marc S.
    Gallo, Steven M.
    Bergman, Casey M.
    NUCLEIC ACIDS RESEARCH, 2008, 36 : D594 - D598
  • [10] Using RSAT to scan genome sequences for transcription factor binding sites and cis-regulatory modules
    Turatsinze, Jean-Valery
    Thomas-Chollier, Morgane
    Defrance, Matthieu
    van Helden, Jacques
    NATURE PROTOCOLS, 2008, 3 (10) : 1578 - 1588