RBProkCNN: Deep learning on appropriate contextual evolutionary information for RNA binding protein discovery in prokaryotes

被引:3
作者
Pradhan, Upendra Kumar [1 ]
Naha, Sanchita [2 ]
Das, Ritwika [3 ]
Gupta, Ajit [1 ]
Parsad, Rajender [4 ]
Meher, Prabina Kumar [1 ]
机构
[1] ICAR Res Complex, Indian Agr Stat Res Inst, Div Stat Genet, New Delhi 110012, India
[2] ICAR Res Complex, Indian Agr Stat Res Inst, Div Comp Applicat, New Delhi 110012, India
[3] ICAR Res Complex, Indian Agr Stat Res Inst, Div Agr Bioinformat, New Delhi 110012, India
[4] ICAR Res Complex, Indian Agr Stat Res Inst, New Delhi 110012, India
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2024年 / 23卷
关键词
RNA-binding proteins; Prediction model; Machine learning; Computational biology; Evolutionary feature; FOLD RECOGNITION; WEB SERVER; PREDICTION; DNA; CLASSIFICATION;
D O I
10.1016/j.csbj.2024.04.034
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
RNA-binding proteins (RBPs) are central to key functions such as post-transcriptional regulation, mRNA stability, and adaptation to varied environmental conditions in prokaryotes. While the majority of research has concentrated on eukaryotic RBPs, recent developments underscore the crucial involvement of prokaryotic RBPs. Although computational methods have emerged in recent years to identify RBPs, they have fallen short in accurately identifying prokaryotic RBPs due to their generic nature. To bridge this gap, we introduce RBProkCNN, a novel machine learning-driven computational model meticulously designed for the accurate prediction of prokaryotic RBPs. The prediction process involves the utilization of eight shallow learning algorithms and four deep learning models, incorporating PSSM-based evolutionary features. By leveraging a convolutional neural network (CNN) and evolutionarily significant features selected through extreme gradient boosting variable importance measure, RBProkCNN achieved the highest accuracy in five-fold cross-validation, yielding 98.04% auROC and 98.19% auPRC. Furthermore, RBProkCNN demonstrated robust performance with an independent dataset, showcasing a commendable 95.77% auROC and 95.78% auPRC. Noteworthy is its superior predictive accuracy when compared to several state-of-the-art existing models. RBProkCNN is available as an online prediction tool (https://iasri-sg.icar.gov.in/rbprokcnn/), offering free access to interested users. This tool represents a substantial contribution, enriching the array of resources available for the accurate and efficient prediction of prokaryotic RBPs.
引用
收藏
页码:1631 / 1640
页数:10
相关论文
共 50 条
  • [21] Coupling dynamics and evolutionary information with structure to identify protein regulatory and functional binding sites
    Mishra, Sambit K.
    Kandoi, Gaurav
    Jernigan, Robert L.
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (10) : 850 - 868
  • [22] SVM based prediction of RNA-binding proteins using binding residues and evolutionary information
    Kumar, Manish
    Gromiha, M. Michael
    Raghava, Gajendra P. S.
    [J]. JOURNAL OF MOLECULAR RECOGNITION, 2011, 24 (02) : 303 - 313
  • [24] Recent methodology progress of deep learning for RNA-protein interaction prediction
    Pan, Xiaoyong
    Yang, Yang
    Xia, Chun-Qiu
    Mirza, Aashiq H.
    Shen, Hong-Bin
    [J]. WILEY INTERDISCIPLINARY REVIEWS-RNA, 2019, 10 (06)
  • [25] Learning protein binding affinity using privileged information
    Wajid Arshad Abbasi
    Amina Asif
    Asa Ben-Hur
    Fayyaz ul Amir Afsar Minhas
    [J]. BMC Bioinformatics, 19
  • [26] A Deep Learning Model for RNA-Protein Binding Preference Prediction Based on Hierarchical LSTM and Attention Network
    Shen, Zhen
    Zhang, Qinhu
    Han, Kyungsook
    Huang, De-Shuang
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 753 - 762
  • [27] Deep Multi-Label Joint Learning for RNA and DNA-Binding Proteins Prediction
    Du, Xiuquan
    Hu, Jiajia
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 307 - 320
  • [28] DeepFusion: A deep bimodal information fusion network for unraveling protein-RNA interactions using in vivo RNA structures
    Qiao, Yixuan
    Yang, Rui
    Liu, Yang
    Chen, Jiaxin
    Zhao, Lianhe
    Huo, Peipei
    Wang, Zhihao
    Bu, Dechao
    Wu, Yang
    Zhao, Yi
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 617 - 625
  • [29] Improved Predicting of The Sequence Specificities of RNA Binding Proteins by Deep Learning
    Tayara, Hilal
    Chong, Kil To
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2526 - 2534
  • [30] EDCNN: identification of genome-wide RNA-binding proteins using evolutionary deep convolutional neural network
    Wang, Yawei
    Yang, Yuning
    Ma, Zhiqiang
    Wong, Ka-Chun
    Li, Xiangtao
    [J]. BIOINFORMATICS, 2022, 38 (03) : 678 - 686