Sequence-Based Predicting Bacterial Essential ncRNAs Algorithm by Machine Learning

被引:0
|
作者
Ye, Yuan-Nong [1 ,2 ,3 ]
Liang, Ding-Fa [2 ]
Labena, Abraham Alemayehu [4 ]
Zeng, Zhu [2 ]
机构
[1] Guizhou Med Univ, Sch Big Hlth, Dept Med Informat, Bioinformat & Biomed Big data Min Lab, Guiyang 550025, Peoples R China
[2] Guizhou Med Univ, Cells & Antibody Engn Res Ctr Guizhou Prov, Sch Biol & Engn, Key Lab Biol & Med Engn, Guiyang 550025, Peoples R China
[3] Guizhou Med Univ, Key Lab Environm Pollut Monitoring & Dis Control, Minist Educ, Guiyang 550025, Peoples R China
[4] Dilla Univ, Coll Computat & Nat Sci, Dilla 419, Ethiopia
基金
中国国家自然科学基金;
关键词
Bioinformatics; biological information theory; biomedical informatics; PROTEIN;
D O I
10.32604/iasc.2023.026761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Essential ncRNA is a type of ncRNA which is indispensable for the sur-vival of organisms. Although essential ncRNAs cannot encode proteins, they are as important as essential coding genes in biology. They have got wide variety of applications such as antimicrobial target discovery, minimal genome construction and evolution analysis. At present, the number of species required for the deter-mination of essential ncRNAs in the whole genome scale is still very few due to the traditional methods are time-consuming, laborious and costly. In addition, tra-ditional experimental methods are limited by the organisms as less than 1% of bacteria can be cultured in the laboratory. Therefore, it is important and necessary to develop theories and methods for the recognition of essential non-coding RNA. In this paper, we present a novel method for predicting essential ncRNA by using both compositional and derivative features calculated by information theory of ncRNA sequences. The method was developed with Support Vector Machine (SVM). The accuracy of the method was evaluated through cross-species cross -vali-dation and found to be between 0.69 and 0.81. It shows that the features we selected have good performance for the prediction of essential ncRNA using SVM. Thus, the method can be applied for discovering essential ncRNAs in bacteria.
引用
收藏
页码:2731 / 2741
页数:11
相关论文
共 50 条
  • [31] Risk-Predicting Model for Incident of Essential Hypertension Based on Environmental and Genetic Factors with Support Vector Machine
    Pei, Zhiyong
    Liu, Jielin
    Liu, Manjiao
    Zhou, Wenchao
    Yan, Pengcheng
    Wen, Shaojun
    Chen, Yubao
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2018, 10 (01) : 126 - 130
  • [32] Risk-Predicting Model for Incident of Essential Hypertension Based on Environmental and Genetic Factors with Support Vector Machine
    Zhiyong Pei
    Jielin Liu
    Manjiao Liu
    Wenchao Zhou
    Pengcheng Yan
    Shaojun Wen
    Yubao Chen
    Interdisciplinary Sciences: Computational Life Sciences, 2018, 10 : 126 - 130
  • [33] EMPIRICAL COMPARISON AND ANALYSIS OF MACHINE LEARNING-BASED PREDICTORS FOR PREDICTING AND ANALYZING OF THERMOPHILIC PROTEINS
    Charoenkwan, Phasit
    Schaduangrat, Nalini
    Hasan, Md Mehedi
    Moni, Mohammad Ali
    Lio, Pietro
    Shoombuatong, Watshara
    EXCLI JOURNAL, 2022, 21 : 554 - 570
  • [34] A Hybrid Structure-Based Machine Learning Approach for Predicting Kinase Inhibition by Small Molecules
    Liu, Changchang
    Kutchukian, Peter
    Nguyen, Nhan D.
    AlQuraishi, Mohammed
    Sorger, Peter K.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (17) : 5457 - 5472
  • [35] Crohn's Disease Prediction Using Sequence Based Machine Learning Analysis of Human Microbiome
    Unal, Metehan
    Bostanci, Erkan
    Ozkul, Ceren
    Acici, Koray
    Asuroglu, Tunc
    Guzel, Mehmet Serdar
    DIAGNOSTICS, 2023, 13 (17)
  • [36] CPPred-FL: a sequence-based predictor for large-scale identification of cell-penetrating peptides by feature representation learning
    Qiang, Xiaoli
    Zhou, Chen
    Ye, Xiucai
    Du, Pu-feng
    Su, Ran
    Wei, Leyi
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (01) : 11 - 23
  • [37] Predicting Protein Relationships to Human Pathways through a Relational Learning Approach Based on Simple Sequence Features
    Garcia-Jimenez, Beatriz
    Pons, Tirso
    Sanchis, Araceli
    Valencia, Alfonso
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (04) : 753 - 765
  • [38] Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster
    Aromolaran, Olufemi Tony
    Isewon, Itunu
    Adedeji, Eunice
    Oswald, Marcus
    Adebiyi, Ezekiel
    Koenig, Rainer
    Oyelade, Jelili
    PLOS ONE, 2023, 18 (08):
  • [39] Predicting viral proteins that evade the innate immune system: a machine learning-based immunoinformatics tool
    Beltran, Jorge F.
    Herrera Belen, Lisandra
    Yanez, Alejandro J.
    Jimenez, Luis
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [40] Identification of significant biomarkers for predicting the risk of bipolar disorder with arteriosclerosis based on integrative bioinformatics and machine learning
    Zheng, Xiabing
    Zhang, Xiaozhe
    Zhang, Yaqi
    Chen, Cai
    Ji, Erni
    FRONTIERS IN PSYCHIATRY, 2024, 15