Computational prediction and characterization of cell-type-specific and shared binding sites

被引:7
|
作者
Zhang, Qinhu [1 ,2 ]
Teng, Pengrui [3 ]
Wang, Siguo [4 ]
He, Ying [4 ]
Cui, Zhen [4 ]
Guo, Zhenghao [4 ]
Liu, Yixin [5 ]
Yuan, Changan [6 ]
Liu, Qi [1 ,2 ]
Huang, De-Shuang [7 ]
机构
[1] Tongji Univ, Translat Med Ctr Stem Cell Therapy, Shanghai 200092, Peoples R China
[2] Tongji Univ, Shanghai East Hosp, Inst Regenerat Med, Sch Life Sci & Technol,Bioinformat Dept, Shanghai 200092, Peoples R China
[3] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
[4] Tongji Univ, Inst Machine Learning & Syst Biol, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China
[5] Univ Shanghai Sci & Technol, Sch Hlth Sci & Engn, Shanghai 200093, Peoples R China
[6] Guangxi Acad Sci, Big Data & Intelligent Comp Res Ctr, Nanning 530007, Peoples R China
[7] EIT Inst Adv Study, Ningbo 315201, Zhejiang, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
CHIP-SEQ; DNA; SEQUENCE; REVEALS;
D O I
10.1093/bioinformatics/btac798
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cell-type-specific gene expression is maintained in large part by transcription factors (TFs) selectively binding to distinct sets of sites in different cell types. Recent research works have provided evidence that such cell-type-specific binding is determined by TF's intrinsic sequence preferences, cooperative interactions with co-factors, cell-type-specific chromatin landscapes and 3D chromatin interactions. However, computational prediction and characterization of cell-type-specific and shared binding sites is rarely studied. Results: In this article, we propose two computational approaches for predicting and characterizing cell-type-specific and shared binding sites by integrating multiple types of features, in which one is based on XGBoost and another is based on convolutional neural network (CNN). To validate the performance of our proposed approaches, ChIP-seq datasets of 10 binding factors were collected from the GM12878 (lymphoblastoid) and K562 (erythroleukemic) human hematopoietic cell lines, each of which was further categorized into cell-type-specific (GM12878- and K562-specific) and shared binding sites. Then, multiple types of features for these binding sites were integrated to train the XGBoost- and CNN-based models. Experimental results show that our proposed approaches significantly outperform other competing methods on three classification tasks. Moreover, we identified independent feature contributions for cell-type-specific and shared sites through SHAP values and explored the ability of the CNN-based model to predict cell-type-specific and shared binding sites by excluding or including DNase signals. Furthermore, we investigated the generalization ability of our proposed approaches to different binding factors in the same cellular environment.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Modelling epigenetic regulation of gene expression in 12 human cell types reveals combinatorial patterns of cell-type-specific genes
    Lu, Yiming
    Qu, Wubin
    Min, Bo
    Liu, Zheyan
    Chen, Changsheng
    Zhang, Chenggang
    IET SYSTEMS BIOLOGY, 2014, 8 (03) : 104 - 115
  • [32] Cell-type-specific nuclei purification from whole animals for genome-wide expression and chromatin profiling
    Steiner, Florian A.
    Talbert, Paul B.
    Kasinathan, Sivakanthan
    Deal, Roger B.
    Henikoff, Steven
    GENOME RESEARCH, 2012, 22 (04) : 766 - 777
  • [33] Monitoring Cell-Type-Specific Gene Expression Using Ribosome Profiling In Vivo During Cardiac Hemodynamic Stress
    Doroudgar, Shirin
    Hofmann, Christoph
    Boileau, Etienne
    Malone, Brandon
    Riechert, Eva
    Gorska, Agnieszka A.
    Jakobi, Tobias
    Sandmann, Clara
    Juergensen, Lonny
    Kmietczyk, Vivien
    Malovrh, Ellen
    Burghaus, Jana
    Rettel, Mandy
    Stein, Frank
    Younesi, Fereshteh
    Friedrich, Ulrike A.
    Mauz, Victoria
    Backs, Johannes
    Kramer, Guenter
    Katus, Hugo A.
    Dieterich, Christoph
    Voelkers, Mirko
    CIRCULATION RESEARCH, 2019, 125 (04) : 431 - 448
  • [34] Receptor-Mediated Delivery of CRISPR-Cas9 Endonuclease for Cell-Type-Specific Gene Editing
    Rouet, Romain
    Thuma, Benjamin A.
    Roy, Marc D.
    Lintner, Nathanael G.
    Rubitski, David M.
    Finley, James E.
    Wisniewska, Hanna M.
    Mendonsa, Rima
    Hirsh, Ariana
    de Onate, Lorena
    Barron, Joan Compte
    McLellan, Thomas J.
    Bellenger, Justin
    Feng, Xidong
    Varghese, Alison
    Chrunyk, Boris A.
    Borzilleri, Kris
    Hesp, Kevin D.
    Zhou, Kaihong
    Ma, Nannan
    Tu, Meihua
    Dullea, Robert
    McClure, Kim F.
    Wilson, Ross C.
    Liras, Spiros
    Mascitti, Vincent
    Doudna, Jennifer A.
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2018, 140 (21) : 6596 - 6603
  • [35] Single-nucleus analysis of accessible chromatin in developing mouse forebrain reveals cell-type-specific transcriptional regulation
    Preissl, Sebastian
    Fang, Rongxin
    Huang, Hui
    Zhao, Yuan
    Raviram, Ramya
    Gorkin, David U.
    Zhang, Yanxiao
    Sos, Brandon C.
    Afzal, Veena
    Dickel, Diane E.
    Kuan, Samantha
    Visel, Axel
    Pennacchio, Len A.
    Zhang, Kun
    Ren, Bing
    NATURE NEUROSCIENCE, 2018, 21 (03) : 432 - +
  • [36] Computational deconvolution: extracting cell type-specific information from heterogeneous samples
    Shen-Orr, Shai S.
    Gaujoux, Renaud
    CURRENT OPINION IN IMMUNOLOGY, 2013, 25 (05) : 571 - 578
  • [37] Cell-type-specific Jumonji histone demethylase gene expression in the healthy rat CNS: detection by a novel flow cytometry method
    Smith, Stephanie M. C.
    Kimyon, Rebecca S.
    Watters, Jyoti J.
    ASN NEURO, 2014, 6 (03): : 193 - 207
  • [38] Identifying transcription factors with cell-type specific DNA binding signatures
    Awdeh, Aseel
    Turcotte, Marcel
    Perkins, Theodore J.
    BMC GENOMICS, 2024, 25 (01):
  • [39] Prediction of mono- and di-nucleotide-specific DNA-binding sites in proteins using neural networks
    Andrabi, Munazah
    Mizuguchi, Kenji
    Sarai, Akinori
    Ahmad, Shandar
    BMC STRUCTURAL BIOLOGY, 2009, 9
  • [40] Identification of universal and cell-type specific p53 DNA binding
    Hafner, Antonina
    Kublo, Lyubov
    Tsabar, Michael
    Lahav, Galit
    Stewart-Ornstein, Jacob
    BMC MOLECULAR AND CELL BIOLOGY, 2020, 21 (01)