共 50 条
RBProkCNN: Deep learning on appropriate contextual evolutionary information for RNA binding protein discovery in prokaryotes
被引:3
作者:
Pradhan, Upendra Kumar
[1
]
Naha, Sanchita
[2
]
Das, Ritwika
[3
]
Gupta, Ajit
[1
]
Parsad, Rajender
[4
]
Meher, Prabina Kumar
[1
]
机构:
[1] ICAR Res Complex, Indian Agr Stat Res Inst, Div Stat Genet, New Delhi 110012, India
[2] ICAR Res Complex, Indian Agr Stat Res Inst, Div Comp Applicat, New Delhi 110012, India
[3] ICAR Res Complex, Indian Agr Stat Res Inst, Div Agr Bioinformat, New Delhi 110012, India
[4] ICAR Res Complex, Indian Agr Stat Res Inst, New Delhi 110012, India
来源:
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL
|
2024年
/
23卷
关键词:
RNA-binding proteins;
Prediction model;
Machine learning;
Computational biology;
Evolutionary feature;
FOLD RECOGNITION;
WEB SERVER;
PREDICTION;
DNA;
CLASSIFICATION;
D O I:
10.1016/j.csbj.2024.04.034
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
RNA-binding proteins (RBPs) are central to key functions such as post-transcriptional regulation, mRNA stability, and adaptation to varied environmental conditions in prokaryotes. While the majority of research has concentrated on eukaryotic RBPs, recent developments underscore the crucial involvement of prokaryotic RBPs. Although computational methods have emerged in recent years to identify RBPs, they have fallen short in accurately identifying prokaryotic RBPs due to their generic nature. To bridge this gap, we introduce RBProkCNN, a novel machine learning-driven computational model meticulously designed for the accurate prediction of prokaryotic RBPs. The prediction process involves the utilization of eight shallow learning algorithms and four deep learning models, incorporating PSSM-based evolutionary features. By leveraging a convolutional neural network (CNN) and evolutionarily significant features selected through extreme gradient boosting variable importance measure, RBProkCNN achieved the highest accuracy in five-fold cross-validation, yielding 98.04% auROC and 98.19% auPRC. Furthermore, RBProkCNN demonstrated robust performance with an independent dataset, showcasing a commendable 95.77% auROC and 95.78% auPRC. Noteworthy is its superior predictive accuracy when compared to several state-of-the-art existing models. RBProkCNN is available as an online prediction tool (https://iasri-sg.icar.gov.in/rbprokcnn/), offering free access to interested users. This tool represents a substantial contribution, enriching the array of resources available for the accurate and efficient prediction of prokaryotic RBPs.
引用
收藏
页码:1631 / 1640
页数:10
相关论文
共 50 条