Prediction of RNA-protein interactions by combining deep convolutional neural network with feature selection ensemble method

被引:35
|
作者
Wang, Lei [1 ]
Yan, Xin [2 ]
Liu, Meng-Lin [1 ]
Song, Ke-Jian [3 ]
Sun, Xiao-Fei [1 ]
Pan, Wen-Wen [1 ]
机构
[1] Zaozhuang Univ, Coll Informat Sci & Engn, Zaozhuang 277100, Shandong, Peoples R China
[2] Zaozhuang Univ, Sch Foreign Languages, Zaozhuang 277100, Shandong, Peoples R China
[3] JiangXi Univ Sci & Technol, Sch Informat Engn, Ganzhou 341000, Jiangxi, Peoples R China
基金
美国国家科学基金会;
关键词
RNA-protein interaction; Convolution neural network; Extreme learning machine; Position-specific scoring matrix; BINDING PROTEINS; SUPPORT; MACHINE; ACCURACY; SEQUENCE;
D O I
10.1016/j.jtbi.2018.10.029
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA-protein interaction (RPI) plays an important role in the basic cellular processes of organisms. Unfortunately, due to time and cost constraints, it is difficult for biological experiments to determine the relationship between RNA and protein to a large extent. So there is an urgent need for reliable computational methods to quickly and accurately predict RNA-protein interaction. In this study, we propose a novel computational method RPIFSE (predicting RPI with Feature Selection Ensemble method) based on RNA and protein sequence information to predict RPI. Firstly, RPIFSE disturbs the features extracted by the convolution neural network (CNN) and generates multiple data sets according to the weight of the feature, and then use extreme learning machine (ELM) classifier to classify these data sets. Finally, the results of each classifier are combined, and the highest score is chosen as the final prediction result by weighting voting method. In 5-fold cross-validation experiments, RPIFSE achieved 91.87%, 89.74%, 97.76% and 98.98% accuracy on RPI369, RPI2241, RPI488 and RPI1807 data sets, respectively. To further evaluate the performance of RPIFSE, we compare it with the state-of-the-art support vector machine (SVM) classifier and other exiting methods on those data sets. Furthermore, we also predicted the RPI on the independent data set NPInter2.0 and drew the network graph based on the prediction results. These promising comparison results demonstrated the effectiveness of RPIFSE and indicated that RPIFSE could be a useful tool for predicting RPI. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:230 / 238
页数:9
相关论文
共 50 条
  • [1] Combining High Speed ELM Learning with a Deep Convolutional Neural Network Feature Encoding for Predicting Protein-RNA Interactions
    Wang, Lei
    You, Zhu-Hong
    Huang, De-Shuang
    Zhou, Fengfeng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (03) : 972 - 980
  • [2] Recent Advances in Machine Learning Based Prediction of RNA-Protein Interactions
    Sagar, Amit
    Xue, Bin
    PROTEIN AND PEPTIDE LETTERS, 2019, 26 (08) : 601 - 619
  • [3] Prediction of RNA-protein interactions using conjoint triad feature and chaos game representation
    Wang, Hongchu
    Wu, Pengfei
    BIOENGINEERED, 2018, 9 (01) : 242 - 251
  • [4] Recent methodology progress of deep learning for RNA-protein interaction prediction
    Pan, Xiaoyong
    Yang, Yang
    Xia, Chun-Qiu
    Mirza, Aashiq H.
    Shen, Hong-Bin
    WILEY INTERDISCIPLINARY REVIEWS-RNA, 2019, 10 (06)
  • [5] A Deep Convolutional Neural Network to Improve the Prediction of Protein Secondary Structure
    Guo, Lin
    Jiang, Qian
    Jin, Xin
    Liu, Lin
    Zhou, Wei
    Yao, Shaowen
    Wu, Min
    Wang, Yun
    CURRENT BIOINFORMATICS, 2020, 15 (07) : 767 - 777
  • [6] ConvsPPIS: Identifying Protein-protein Interaction Sites by an Ensemble Convolutional Neural Network with Feature Graph
    Zhu, Huaixu
    Du, Xiuquan
    Yao, Yu
    CURRENT BIOINFORMATICS, 2020, 15 (04) : 368 - 378
  • [7] Prediction of Protein-Protein Interactions in Arabidopsis, Maize, and Rice by Combining Deep Neural Network With Discrete Hilbert Transform
    Pan, Jie
    Li, Li-Ping
    You, Zhu-Hong
    Yu, Chang-Qing
    Ren, Zhong-Hao
    Guan, Yong-Jian
    FRONTIERS IN GENETICS, 2021, 12
  • [8] Prediction of RNA-protein interactions using a nucleotide language model
    Yamada, Keisuke
    Hamada, Michiaki
    Arighi, Cecilia
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [9] Graph neural representational learning of RNA secondary structures for predicting RNA-protein interactions
    Yan, Zichao
    Hamilton, William L.
    Blanchette, Mathieu
    BIOINFORMATICS, 2020, 36 : 276 - 284
  • [10] RBPsuite: RNA-protein binding sites prediction suite based on deep learning
    Pan, Xiaoyong
    Fang, Yi
    Li, Xianfeng
    Yang, Yang
    Shen, Hong-Bin
    BMC GENOMICS, 2020, 21 (01)