Efficient Framework for Predicting ncRNA-Protein Interactions Based on Sequence Information by Deep Learning

被引:1
作者
Zhan, Zhao-Hui [1 ]
You, Zhu-Hong [2 ]
Zhou, Yong [1 ]
Li, Li-Ping [2 ]
Li, Zheng-Wei [1 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 21116, Jiangsu, Peoples R China
[2] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
来源
INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II | 2018年 / 10955卷
关键词
Protein-ncRNA interaction; Bi-gram; Deep learning; Stacked autoencoder; PSSM; LONG NONCODING RNAS; ACCURATE PREDICTION; AMINO-ACIDS; DATABASE; ROBUST;
D O I
10.1007/978-3-319-95933-7_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The interactions between proteins and RNA (RPIs) play a crucial role in most cellular processes such as RNA stability and translation. Although there have been many high-throughput experiments recently to detect RPIs, these experiments are largely time-consuming and labor-intensive. Therefore, it is imminent to propose an efficient computational method to predict RPIs. In this study, we put forward a novel approach for predicting protein and ncRNA interactions based on sequences information only. By employing the bi-gram probability feature extraction method and k-mer algorithm, the represent features from protein and ncRNA were extracted. To evaluate the performance of the proposed model, two widely used datasets named RPI1807 and RPI2241 were trained with the adoption of random forest classifier by using five-fold cross-validation. The experimental results with the AUC of 0.992 and 0.947 on dataset RPI1807 and RPI2241 respectively indicated the effectiveness of our experimental approach for predicting RPIs, which provided the guidance for reference for future research in the biological field.
引用
收藏
页码:337 / 344
页数:8
相关论文
共 50 条
  • [1] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [2] Identification of self-interacting proteins by exploring evolutionary information embedded in PSI-BLAST-constructed position specific scoring matrix
    An, Ji-Yong
    You, Zhu-Hong
    Chen, Xing
    Huang, De-Shuang
    Li, Zheng-Wei
    Liu, Gang
    Wang, Yin
    [J]. ONCOTARGET, 2016, 7 (50) : 82440 - 82449
  • [3] Robust and accurate prediction of protein self-interactions from amino acids sequence using evolutionary information
    An, Ji-Yong
    You, Zhu-Hong
    Chen, Xing
    Huang, De-Shuang
    Yan, Guiying
    Wang, Da-Fu
    [J]. MOLECULAR BIOSYSTEMS, 2016, 12 (12) : 3702 - 3710
  • [4] RVMAB: Using the Relevance Vector Machine Model Combined with Average Blocks to Predict the Interactions of Proteins from Protein Sequences
    An, Ji-Yong
    You, Zhu-Hong
    Meng, Fan-Rong
    Xu, Shu-Juan
    Wang, Yin
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (05)
  • [5] Using the Relevance Vector Machine Model Combined with Local Phase Quantization to Predict Protein-Protein Interactions from Protein Sequences
    An, Ji-Yong
    Meng, Fan-Rong
    You, Zhu-Hong
    Fang, Yu-Hong
    Zhao, Yu-Jun
    Zhang, Ming
    [J]. BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [6] The BioGRID interaction database: 2015 update
    Chatr-aryamontri, Andrew
    Breitkreutz, Bobby-Joe
    Oughtred, Rose
    Boucher, Lorrie
    Heinicke, Sven
    Chen, Daici
    Stark, Chris
    Breitkreutz, Ashton
    Kolas, Nadine
    O'Donnell, Lara
    Reguly, Teresa
    Nixon, Julie
    Ramage, Lindsay
    Winter, Andrew
    Sellam, Adnane
    Chang, Christie
    Hirschman, Jodi
    Theesfeld, Chandra
    Rust, Jennifer
    Livstone, Michael S.
    Dolinski, Kara
    Tyers, Mike
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D470 - D478
  • [7] Chen H, 2015, 9 INT C P2P PAR GRID, P333
  • [8] Long non-coding RNAs and complex diseases: from experimental results to computational models
    Chen, Xing
    Yan, Chenggang Clarence
    Zhang, Xu
    You, Zhu-Hong
    [J]. BRIEFINGS IN BIOINFORMATICS, 2017, 18 (04) : 558 - 576
  • [9] FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model
    Chen, Xing
    Huang, Yu-An
    Wang, Xue-Song
    You, Zhu-Hong
    Chan, Keith C. C.
    [J]. ONCOTARGET, 2016, 7 (29) : 45948 - 45958
  • [10] IRWRLDA: improved random walk with restart for lncRNA-disease association prediction
    Chen, Xing
    You, Zhu-Hong
    Yan, Gui-Ying
    Gong, Dun-Wei
    [J]. ONCOTARGET, 2016, 7 (36) : 57919 - 57931