A Deep Learning Model for RNA-Protein Binding Preference Prediction Based on Hierarchical LSTM and Attention Network

被引:35
作者
Shen, Zhen [1 ]
Zhang, Qinhu [1 ]
Han, Kyungsook [2 ]
Huang, De-Shuang [1 ]
机构
[1] Tongji Univ, Sch Elect & Informat Engn, Inst Machine Learning & Syst Biol, 4800 Caoan Rd, Shanghai 201804, Peoples R China
[2] Inha Univ, Sch Comp Sci & Engn, Incheon, South Korea
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Supply chains; Cloud computing; Dynamic scheduling; Task analysis; Manufacturing systems; K-mer embedding; attention mechanism; bidirectional LSTM; RNA-protein binding preference; GENE-EXPRESSION; NEURAL-NETWORKS; MATRIX FACTORIZATION; CLASSIFICATION; OPTIMIZATION; METHODOLOGY; MECHANISMS; PACKAGE; SHAPES;
D O I
10.1109/TCBB.2020.3007544
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Attention mechanism has the ability to find important information in the sequence. The regions of the RNA sequence that can bind to proteins are more important than those that cannot bind to proteins. Neither conventional methods nor deep learning-based methods, they are not good at learning this information. In this study, LSTM is used to extract the correlation features between different sites in RNA sequence. We also use attention mechanism to evaluate the importance of different sites in RNA sequence. We get the optimal combination of k-mer length, k-mer stride window, k-mer sentence length, k-mer sentence stride window, and optimization function through hyper-parm experiments. The results show that the performance of our method is better than other methods. We tested the effects of changes in k-mer vector length on model performance. We show model performance changes under various k-mer related parameter settings. Furthermore, we investigate the effect of attention mechanism and RNA structure data on model performance.
引用
收藏
页码:753 / 762
页数:10
相关论文
共 60 条
  • [1] Adeel A., 2014, BMC PLANT BIOL, P1, DOI DOI 10.1186/s12870-014-0327-y
  • [2] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [3] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
    Anderson, Peter
    He, Xiaodong
    Buehler, Chris
    Teney, Damien
    Johnson, Mark
    Gould, Stephen
    Zhang, Lei
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
  • [4] Ba J, 2018, ARXIV14127755
  • [5] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [6] Origins and Mechanisms of miRNAs and siRNAs
    Carthew, Richard W.
    Sontheimer, Erik J.
    [J]. CELL, 2009, 136 (04) : 642 - 655
  • [7] Chan W., 2017, U. S. Patent, Patent No. [9,799,327, 9799327]
  • [8] Cho K, 2014, ARXIV14061078, P1724
  • [9] Predicting Hub Genes Associated with Cervical Cancer through Gene Co-Expression Networks
    Deng, Su-Ping
    Zhu, Lin
    Huang, De-Shuang
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (01) : 27 - 35
  • [10] Mining the bladder cancer-associated genes by an integrated strategy for the construction and analysis of differential co-expression networks
    Deng, Su-Ping
    Zhu, Lin
    Huang, De-Shuang
    [J]. BMC GENOMICS, 2015, 16