RBProkCNN: Deep learning on appropriate contextual evolutionary information for RNA binding protein discovery in prokaryotes

被引:3
|
作者
Pradhan, Upendra Kumar [1 ]
Naha, Sanchita [2 ]
Das, Ritwika [3 ]
Gupta, Ajit [1 ]
Parsad, Rajender [4 ]
Meher, Prabina Kumar [1 ]
机构
[1] ICAR Res Complex, Indian Agr Stat Res Inst, Div Stat Genet, New Delhi 110012, India
[2] ICAR Res Complex, Indian Agr Stat Res Inst, Div Comp Applicat, New Delhi 110012, India
[3] ICAR Res Complex, Indian Agr Stat Res Inst, Div Agr Bioinformat, New Delhi 110012, India
[4] ICAR Res Complex, Indian Agr Stat Res Inst, New Delhi 110012, India
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2024年 / 23卷
关键词
RNA-binding proteins; Prediction model; Machine learning; Computational biology; Evolutionary feature; FOLD RECOGNITION; WEB SERVER; PREDICTION; DNA; CLASSIFICATION;
D O I
10.1016/j.csbj.2024.04.034
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
RNA-binding proteins (RBPs) are central to key functions such as post-transcriptional regulation, mRNA stability, and adaptation to varied environmental conditions in prokaryotes. While the majority of research has concentrated on eukaryotic RBPs, recent developments underscore the crucial involvement of prokaryotic RBPs. Although computational methods have emerged in recent years to identify RBPs, they have fallen short in accurately identifying prokaryotic RBPs due to their generic nature. To bridge this gap, we introduce RBProkCNN, a novel machine learning-driven computational model meticulously designed for the accurate prediction of prokaryotic RBPs. The prediction process involves the utilization of eight shallow learning algorithms and four deep learning models, incorporating PSSM-based evolutionary features. By leveraging a convolutional neural network (CNN) and evolutionarily significant features selected through extreme gradient boosting variable importance measure, RBProkCNN achieved the highest accuracy in five-fold cross-validation, yielding 98.04% auROC and 98.19% auPRC. Furthermore, RBProkCNN demonstrated robust performance with an independent dataset, showcasing a commendable 95.77% auROC and 95.78% auPRC. Noteworthy is its superior predictive accuracy when compared to several state-of-the-art existing models. RBProkCNN is available as an online prediction tool (https://iasri-sg.icar.gov.in/rbprokcnn/), offering free access to interested users. This tool represents a substantial contribution, enriching the array of resources available for the accurate and efficient prediction of prokaryotic RBPs.
引用
收藏
页码:1631 / 1640
页数:10
相关论文
共 50 条
  • [1] RBPSpot: Learning on appropriate contextual information for RBP binding sites discovery
    Sharma, Nitesh Kumar
    Gupta, Sagar
    Kumar, Ashwani
    Kumar, Prakash
    Pradhan, Upendra Kumar
    Shankar, Ravi
    ISCIENCE, 2021, 24 (12)
  • [2] RBPLight: a computational tool for discovery of plant-specific RNA-binding proteins using light gradient boosting machine and ensemble of evolutionary features
    Pradhan, Upendra K.
    Meher, Prabina K.
    Naha, Sanchita
    Pal, Soumen
    Gupta, Sagar
    Gupta, Ajit
    Parsad, Rajender
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2023, 22 (05) : 401 - 410
  • [3] Ranking Protein-Protein Binding Using Evolutionary Information and Machine Learning
    Farhoodi, Roshanak
    Akbal-Delibas, Bahar
    Haspel, Nurit
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 667 - 672
  • [4] A deep neural network approach for learning intrinsic protein-RNA binding preferences
    Ben-Bassat, Ilan
    Chor, Benny
    Orenstein, Yaron
    BIOINFORMATICS, 2018, 34 (17) : 638 - 646
  • [5] DeepFusion-RBP: Using Deep Learning to Fuse Multiple Features to Identify RNA-binding Protein Sequences
    Wang, Xu
    Wang, Shunfang
    Fu, Haoyi
    Ruan, Xiaoli
    Tang, Xianjun
    CURRENT BIOINFORMATICS, 2021, 16 (08) : 1089 - 1100
  • [6] A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information
    Yi, Hai-Cheng
    You, Zhu-Hong
    Huang, De-Shuang
    Li, Xiao
    Jiang, Tong-Hai
    Li, Li-Ping
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 11 : 337 - 344
  • [7] iHBP-DeepPSSM: Identifying hormone binding proteins using PsePSSM based evolutionary features and deep learning approach
    Akbar, Shahid
    Khan, Salman
    Ali, Farman
    Hayat, Maqsood
    Qasim, Muhammad
    Gul, Sarah
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 204 (204)
  • [8] A Novel Deep Learning Method for Predicting RNA-Protein Binding Sites
    Zhao, Xueru
    Chang, Furong
    Lv, Hehe
    Zou, Guobing
    Zhang, Bofeng
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [9] A deep learning framework to predict binding preference of RNA constituents on protein surface
    Lam, Jordy Homing
    Li, Yu
    Zhu, Lizhe
    Umarov, Ramzan
    Jiang, Hanlun
    Heliou, Amelie
    Sheong, Fu Kit
    Liu, Tianyun
    Long, Yongkang
    Li, Yunfei
    Fang, Liang
    Altman, Russ B.
    Chen, Wei
    Huang, Xuhui
    Gao, Xin
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [10] Learning protein binding affinity using privileged information
    Abbasi, Wajid Arshad
    Asif, Amina
    Ben-Hur, Asa
    Minhas, Fayyaz ul Amir Afsar
    BMC BIOINFORMATICS, 2018, 19