Cross-Species Prediction of Transcription Factor Binding by Adversarial Training of a Novel Nucleotide-Level Deep Neural Network

被引:1
|
作者
Zhang, Qinhu [1 ,2 ,3 ]
Wang, Siguo [1 ]
Li, Zhipeng [1 ]
Pan, Yijie [1 ]
Huang, De-Shuang [1 ,4 ]
机构
[1] Ningbo Inst Digital Twin, Eastern Inst Technol, Ningbo 315201, Peoples R China
[2] Univ Sci & Technol China, Div Life Sci & Med, Hefei 230021, Peoples R China
[3] Guangxi Acad Sci, Big Data & Intelligent Comp Res Ctr, Nanning 530007, Peoples R China
[4] Tongji Univ, Shanghai East Hosp, Inst Regenerat Med, Shanghai 200092, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
adversarial training; cross-species transcription factor binding prediction; nucleotide-level models; sequence-level models; DNA; DATABASE; GENOME; MOUSE;
D O I
10.1002/advs.202405685
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Cross-species prediction of TF binding remains a major challenge due to the rapid evolutionary turnover of individual TF binding sites, resulting in cross-species predictive performance being consistently worse than within-species performance. In this study, a novel Nucleotide-Level Deep Neural Network (NLDNN) is first proposed to predict TF binding within or across species. NLDNN regards the task of TF binding prediction as a nucleotide-level regression task, which takes DNA sequences as input and directly predicts experimental coverage values. Beyond predictive performance, it also assesses model performance by locating potential TF binding regions, discriminating TF-specific single-nucleotide polymorphisms (SNPs), and identifying causal disease-associated SNPs. The experimental results show that NLDNN outperforms the competing methods in these tasks. Then, a dual-path framework is designed for adversarial training of NLDNN to further improve the cross-species prediction performance by pulling the domain space of human and mouse species closer. Through comparison and analysis, it finds that adversarial training not only can improve the cross-species prediction performance between humans and mice but also enhance the ability to locate TF binding regions and discriminate TF-specific SNPs. By visualizing the predictions, it is figured out that the framework corrects some mispredictions by amplifying the coverage values of incorrectly predicted peaks. This work proposes a Nucleotide-Level Deep Neural Network (NLDNN) to study transcription factor binding within or across species from the perspective of nucleotide resolution. Moreover, this work designs a dual-path framework for adversarial training of NLDNN to further improve the cross-species prediction performance by pulling the domain space of human and mouse species closer. image
引用
收藏
页数:17
相关论文
共 10 条
  • [1] Domain-adaptive neural networks improve cross-species prediction of transcription factor binding
    Cochran, Kelly
    Srivastava, Divyanshi
    Shrikumar, Avanti
    Balsubramani, Akshay
    Hardison, Ross C.
    Kundaje, Anshul
    Mahony, Shaun
    GENOME RESEARCH, 2022, 32 (03) : 512 - 523
  • [2] PhyloScan: identification of transcription factor binding sites using cross-species evidence
    Carmack, C. Steven
    McCue, Lee Ann
    Newberg, Lee A.
    Lawrence, Charles E.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2007, 2
  • [3] Robust Transcription Factor Binding Site Prediction Using Deep Neural Networks
    Geete, Kanu
    Pandey, Manish
    CURRENT BIOINFORMATICS, 2020, 15 (10) : 1137 - 1152
  • [4] Prediction of Transcription Factor Binding Sites With an Attention Augmented Convolutional Neural Network
    Jing, Fang
    Zhang, Shao-Wu
    Zhang, Shihua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3614 - 3623
  • [5] A Novel Adversarial Training Scheme for Deep Neural Network based Speech Enhancement
    Cornell, Samuele
    Principi, Emanuele
    Squartini, Stefano
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [6] Cross-Cell-Type Prediction of TF-Binding Site by Integrating Convolutional Neural Network and Adversarial Network
    Lan, Gongqiang
    Zhou, Jiyun
    Xu, Ruifeng
    Lu, Qin
    Wang, Hongpeng
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (14)
  • [7] GNet: An integrated context-aware neural framework for transcription factor binding signal at single nucleotide resolution prediction
    Zhuang, Jujuan
    Feng, Kexin
    Teng, Xinyang
    Jia, Cangzhi
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 15809 - 15829
  • [8] DeepGRN: prediction of transcription factor binding site across cell-types using attention-based deep neural networks
    Chen, Chen
    Hou, Jie
    Shi, Xiaowen
    Yang, Hua
    Birchler, James A.
    Cheng, Jianlin
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [9] DeepTF: Accurate Prediction of Transcription Factor Binding Sites by Combining Multi-scale Convolution and Long Short-Term Memory Neural Network
    Bao, Xiao-Rong
    Zhu, Yi-Heng
    Yu, Dong-Jun
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING, PT II, 2019, 11936 : 126 - 138
  • [10] Using k-mer embeddings learned from a Skip-gram based neural network for building a cross-species DNA N6-methyladenine site prediction model
    Trinh Trung Duong Nguyen
    Van Ngu Trinh
    Nguyen Quoc Khanh Le
    Ou, Yu-Yen
    PLANT MOLECULAR BIOLOGY, 2021, 107 (06) : 533 - 542