Sequence-Based Predicting Bacterial Essential ncRNAs Algorithm by Machine Learning

被引:0
|
作者
Ye, Yuan-Nong [1 ,2 ,3 ]
Liang, Ding-Fa [2 ]
Labena, Abraham Alemayehu [4 ]
Zeng, Zhu [2 ]
机构
[1] Guizhou Med Univ, Sch Big Hlth, Dept Med Informat, Bioinformat & Biomed Big data Min Lab, Guiyang 550025, Peoples R China
[2] Guizhou Med Univ, Cells & Antibody Engn Res Ctr Guizhou Prov, Sch Biol & Engn, Key Lab Biol & Med Engn, Guiyang 550025, Peoples R China
[3] Guizhou Med Univ, Key Lab Environm Pollut Monitoring & Dis Control, Minist Educ, Guiyang 550025, Peoples R China
[4] Dilla Univ, Coll Computat & Nat Sci, Dilla 419, Ethiopia
基金
中国国家自然科学基金;
关键词
Bioinformatics; biological information theory; biomedical informatics; PROTEIN;
D O I
10.32604/iasc.2023.026761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Essential ncRNA is a type of ncRNA which is indispensable for the sur-vival of organisms. Although essential ncRNAs cannot encode proteins, they are as important as essential coding genes in biology. They have got wide variety of applications such as antimicrobial target discovery, minimal genome construction and evolution analysis. At present, the number of species required for the deter-mination of essential ncRNAs in the whole genome scale is still very few due to the traditional methods are time-consuming, laborious and costly. In addition, tra-ditional experimental methods are limited by the organisms as less than 1% of bacteria can be cultured in the laboratory. Therefore, it is important and necessary to develop theories and methods for the recognition of essential non-coding RNA. In this paper, we present a novel method for predicting essential ncRNA by using both compositional and derivative features calculated by information theory of ncRNA sequences. The method was developed with Support Vector Machine (SVM). The accuracy of the method was evaluated through cross-species cross -vali-dation and found to be between 0.69 and 0.81. It shows that the features we selected have good performance for the prediction of essential ncRNA using SVM. Thus, the method can be applied for discovering essential ncRNAs in bacteria.
引用
收藏
页码:2731 / 2741
页数:11
相关论文
共 50 条
  • [41] Co-evolution based machine-learning for predicting functional interactions between human genes
    Stupp, Doron
    Sharon, Elad
    Bloch, Idit
    Zitnik, Marinka
    Zuk, Or
    Tabach, Yuval
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [42] Charge and hydrophobicity are key features in sequence-trained machine learning models for predicting the biophysical properties of clinical-stage antibodies
    Hebditch, Max
    Warwicker, Jim
    PEERJ, 2019, 7
  • [43] A Stacking Machine Learning Method for IL-10-Induced Peptide Sequence Recognition Based on Unified Deep Representation Learning
    Li, Jiayu
    Jiang, Jici
    Pei, Hongdi
    Lv, Zhibin
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [44] Predicting Motor and Cognitive Improvement Through Machine Learning Algorithm in Human Subject that Underwent a Rehabilitation Treatment in the Early Stage of Stroke
    Sale, Patrizio
    Ferriero, Giorgio
    Ciabattoni, Lucio
    Cortese, Anna Maria
    Ferracuti, Francesco
    Romeo, Luca
    Piccione, Francesco
    Masiero, Stefano
    JOURNAL OF STROKE & CEREBROVASCULAR DISEASES, 2018, 27 (11) : 2962 - 2972
  • [45] Predicting COVID-19 disease severity from SARS-CoV-2 spike protein sequence by mixed effects machine learning
    Sokhansanj, Bahrad A.
    Rosen, Gail L.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 149
  • [46] MLAS: Machine Learning-Based Approach for Predicting Abiotic Stress-Responsive Genes in Chinese Cabbage
    You, Xiong
    Shu, Yiting
    Ni, Xingcheng
    Lv, Hengmin
    Luo, Jian
    Tao, Jianping
    Bai, Guanghui
    Feng, Shusu
    HORTICULTURAE, 2025, 11 (01)
  • [47] PredLnc-GFStack: A Global Sequence Feature Based on a Stacked Ensemble Learning Method for Predicting lncRNAs from Transcripts
    Liu, Shuai
    Zhao, Xiaohan
    Zhang, Guangyan
    Li, Weiyang
    Liu, Feng
    Liu, Shichao
    Zhang, Wen
    GENES, 2019, 10 (09)
  • [48] A New Model for Caries Risk Prediction in Teenagers Using a Machine Learning Algorithm Based on Environmental and Genetic Factors
    Pang, Liangyue
    Wang, Ketian
    Tao, Ye
    Zhi, Qinghui
    Zhang, Jianming
    Lin, Huancai
    FRONTIERS IN GENETICS, 2021, 12
  • [49] Effect of a Machine Learning-Based Severe Sepsis Prediction Algorithm on Patient Survival and Hospital Length of Stay
    Barton, Chris
    Shimabukuro, David
    Feldman, Mitchell D.
    Mataroso, Samson
    Das, Ritankar
    CIRCULATION, 2017, 136
  • [50] Chromatin interaction neural network (ChINN): a machine learning-based method for predicting chromatin interactions from DNA sequences
    Fan Cao
    Yu Zhang
    Yichao Cai
    Sambhavi Animesh
    Ying Zhang
    Semih Can Akincilar
    Yan Ping Loh
    Xinya Li
    Wee Joo Chng
    Vinay Tergaonkar
    Chee Keong Kwoh
    Melissa J. Fullwood
    Genome Biology, 22