Prediction of RNA-protein interactions by combining deep convolutional neural network with feature selection ensemble method

被引:35
作者
Wang, Lei [1 ]
Yan, Xin [2 ]
Liu, Meng-Lin [1 ]
Song, Ke-Jian [3 ]
Sun, Xiao-Fei [1 ]
Pan, Wen-Wen [1 ]
机构
[1] Zaozhuang Univ, Coll Informat Sci & Engn, Zaozhuang 277100, Shandong, Peoples R China
[2] Zaozhuang Univ, Sch Foreign Languages, Zaozhuang 277100, Shandong, Peoples R China
[3] JiangXi Univ Sci & Technol, Sch Informat Engn, Ganzhou 341000, Jiangxi, Peoples R China
基金
美国国家科学基金会;
关键词
RNA-protein interaction; Convolution neural network; Extreme learning machine; Position-specific scoring matrix; BINDING PROTEINS; SUPPORT; MACHINE; ACCURACY; SEQUENCE;
D O I
10.1016/j.jtbi.2018.10.029
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA-protein interaction (RPI) plays an important role in the basic cellular processes of organisms. Unfortunately, due to time and cost constraints, it is difficult for biological experiments to determine the relationship between RNA and protein to a large extent. So there is an urgent need for reliable computational methods to quickly and accurately predict RNA-protein interaction. In this study, we propose a novel computational method RPIFSE (predicting RPI with Feature Selection Ensemble method) based on RNA and protein sequence information to predict RPI. Firstly, RPIFSE disturbs the features extracted by the convolution neural network (CNN) and generates multiple data sets according to the weight of the feature, and then use extreme learning machine (ELM) classifier to classify these data sets. Finally, the results of each classifier are combined, and the highest score is chosen as the final prediction result by weighting voting method. In 5-fold cross-validation experiments, RPIFSE achieved 91.87%, 89.74%, 97.76% and 98.98% accuracy on RPI369, RPI2241, RPI488 and RPI1807 data sets, respectively. To further evaluate the performance of RPIFSE, we compare it with the state-of-the-art support vector machine (SVM) classifier and other exiting methods on those data sets. Furthermore, we also predicted the RPI on the independent data set NPInter2.0 and drew the network graph based on the prediction results. These promising comparison results demonstrated the effectiveness of RPIFSE and indicated that RPIFSE could be a useful tool for predicting RPI. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:230 / 238
页数:9
相关论文
共 50 条
  • [31] A novel convolutional neural network framework based solar irradiance prediction method
    Dong, Na
    Chang, Jian-Fang
    Wu, Ai-Guo
    Gao, Zhong-Ke
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2020, 114
  • [32] RDense: A Protein-RNA Binding Prediction Model Based on Bidirectional Recurrent Neural Network and Densely Connected Convolutional Networks
    Li, Zhong
    Zhu, Jiapeng
    Xu, Xiaojiang
    Yao, Yuhua
    IEEE ACCESS, 2020, 8 (08): : 14588 - 14605
  • [33] A multiple-input deep residual convolutional neural network for reservoir permeability prediction
    Masroor, Milad
    Niri, Mohammad Emami
    Sharifinasab, Mohammad Hassan
    GEOENERGY SCIENCE AND ENGINEERING, 2023, 222
  • [34] DeepFusion: A deep bimodal information fusion network for unraveling protein-RNA interactions using in vivo RNA structures
    Qiao, Yixuan
    Yang, Rui
    Liu, Yang
    Chen, Jiaxin
    Zhao, Lianhe
    Huo, Peipei
    Wang, Zhihao
    Bu, Dechao
    Wu, Yang
    Zhao, Yi
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 617 - 625
  • [35] An ensemble approach to predict binding hotspots in protein-RNA interactions based on SMOTE data balancing and Random Grouping feature selection strategies
    Zhou, Tong
    Rong, Jie
    Liu, Yang
    Gong, Weikang
    Li, Chunhua
    BIOINFORMATICS, 2022, 38 (09) : 2452 - 2458
  • [36] RPI-GGCN: Prediction of RNA-Protein Interaction Based on Interpretability Gated Graph Convolution Neural Network and Co-Regularized Variational Autoencoders
    Wang, Yifei
    Ding, Pengju
    Wang, Congjing
    He, Shiyue
    Gao, Xin
    Yu, Bin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [37] Error Prediction Algorithm of Medical Image Based on Convolution Neural Network and Feature Selection
    Li X.
    Liu G.
    Wei J.
    Wang Y.
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2021, 48 (04): : 90 - 99
  • [38] The Bipartite Network Projection-Recommended Algorithm for Predicting Long Non-coding RNA-Protein Interactions
    Zhao, Qi
    Yu, Haifan
    Ming, Zhong
    Hu, Huan
    Ren, Guofei
    Liu, Hongsheng
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 13 : 464 - 471
  • [39] The Development of RNA-KISS, a Mammalian Three-Hybrid Method to Detect RNA-Protein Interactions in Living Mammalian Cells
    Lemmens, Irma
    Jansen, Sander
    de Rouck, Steffi
    De Smet, Anne-Sophie
    Defever, Dieter
    Neyts, Johan
    Dallmeier, Kai
    Tavernier, Jan
    JOURNAL OF PROTEOME RESEARCH, 2020, 19 (07) : 2529 - 2538
  • [40] Artificial Neural Network Approach to Prediction of Protein-RNA Residue-base Contacts
    Hayashida, Morihiro
    Nacher, Jose
    Koyano, Hitoshi
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3 (BIOINFORMATICS), 2019, : 163 - 167