Combining Sequence and Epigenomic Data to Predict Transcription Factor Binding Sites Using Deep Learning

被引:1
|
作者
Jing, Fang [1 ]
Zhang, Shao-Wu [1 ]
Cao, Zhen [2 ]
Zhang, Shihua l [2 ,3 ]
机构
[1] Northwestern Polytech Univ, Coll Automat, Minist Educ, Key Lab Informat Fusion Technol, Xian 710072, Shaanxi, Peoples R China
[2] Chinese Acad Sci, Acad Math & Syst Sci, NCMIS, CEMS,RCSDS, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
来源
BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2018 | 2018年 / 10847卷
基金
中国国家自然科学基金;
关键词
Bioinformatics; Machine learning; Transcription factors binding sites; Convolutional neural networks; DNA accessibility; Histone modification; CHROMATIN ACCESSIBILITY PREDICTION; NETWORKS;
D O I
10.1007/978-3-319-94968-0_23
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowing the transcription factor binding sites (TFBSs) is essential for modeling the underlying binding mechanisms and follow-up cellular functions. Convolutional neural networks (CNNs) have outperformed methods in predicting TFBSs from the primary DNA sequence. In addition to DNA sequences, histone modifications and chromatin accessibility are also important factors influencing their activity. They have been explored to predict TFBSs recently. However, current methods rarely take into account histone modifications and chromatin accessibility using CNN in an integrative framework. To this end, we developed a general CNN model to integrate these data for predicting TFBSs. We systematically benchmarked a series of architecture variants by changing network structure in terms of width and depth, and explored the effects of sample length at flanking regions. We evaluated the performance of the three types of data and their combinations using 256 ChIP-seq experiments and also compared it with competing machine learning methods. We find that contributions from these three types of data are complementary to each other. Moreover, the integrative CNN framework is superior to traditional machine learning methods with significant improvements.
引用
收藏
页码:241 / 252
页数:12
相关论文
共 50 条
  • [21] Evolutionary couplings and sequence variation effect predict protein binding sites
    Schelling, Maria
    Hopf, Thomas A.
    Rost, Burkhard
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 (10) : 1064 - 1074
  • [22] Using deep learning to predict soil properties from regional spectral data
    Padarian, J.
    Minasny, B.
    McBratney, A. B.
    GEODERMA REGIONAL, 2019, 16
  • [23] A Novel Computational Approach To Predict Transcription Factor DNA Binding Preference
    Cai, Yudong
    He, JianFeng
    Li, XinLei
    Lu, Lin
    Yang, XinYi
    Feng, KayYan
    Lu, WenCong
    Kong, XiangYin
    JOURNAL OF PROTEOME RESEARCH, 2009, 8 (02) : 999 - 1003
  • [24] TF-COMB - Discovering grammar of transcription factor binding sites
    Bentsen, Mette
    Heger, Vanessa
    Schultheis, Hendrik
    Kuenne, Carsten
    Looso, Mario
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 4040 - 4051
  • [25] COMBINING DEEP LEARNING AND NUMERICAL SIMULATION TO PREDICT FLOOD INUNDATION DEPTH
    Adriano, Bruno
    Yokoya, Naoto
    Yamanoi, Kazuki
    Oishi, Satoru
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 1154 - 1157
  • [26] Learning epistatic interactions from sequence-activity data to predict enantioselectivity
    Julian Zaugg
    Yosephine Gumulya
    Alpeshkumar K. Malde
    Mikael Bodén
    Journal of Computer-Aided Molecular Design, 2017, 31 : 1085 - 1096
  • [27] Learning epistatic interactions from sequence-activity data to predict enantioselectivity
    Zaugg, Julian
    Gumulya, Yosephine
    Malde, Alpeshkumar K.
    Boden, Mikael
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2017, 31 (12) : 1085 - 1096
  • [28] Distracted driver detection by combining in-vehicle and image data using deep learning
    Omerustaoglu, Furkan
    Sakar, C. Okan
    Kar, Gorkem
    APPLIED SOFT COMPUTING, 2020, 96
  • [29] BERT-TFBS: a novel BERT-based model for predicting transcription factor binding sites by transfer learning
    Wang, Kai
    Zeng, Xuan
    Zhou, Jingwen
    Liu, Fei
    Luan, Xiaoli
    Wang, Xinglong
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [30] MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
    Albaradei, Somayah
    Albaradei, Abdurhman
    Alsaedi, Asim
    Uludag, Mahmut
    Thafar, Maha A.
    Gojobori, Takashi
    Essack, Magbubah
    Gao, Xin
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9