RNAVirHost: a machine learning-based method for predicting hosts of RNA viruses through viral genomes

被引:0
作者
Chen, Guowei [1 ]
Jiang, Jingzhe [2 ]
Sun, Yanni [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Kowloon, 83 Tat Chee Ave, Hong Kong, Peoples R China
[2] Chinese Acad Fishery Sci, South China Sea Fisheries Res Inst, Key Lab South China Sea Fishery Resources Exploita, Minist Agr & Rural Affairs, Guangzhou 510300, Peoples R China
来源
GIGASCIENCE | 2024年 / 13卷
关键词
RNA virus; host prediction; machine learning; metagenomics; MOLECULAR CHARACTERIZATION; VECTORS;
D O I
10.1093/gigascience/giae059
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background The high-throughput sequencing technologies have revolutionized the identification of novel RNA viruses. Given that viruses are infectious agents, identifying hosts of these new viruses carries significant implications for public health and provides valuable insights into the dynamics of the microbiome. However, determining the hosts of these newly discovered viruses is not always straightforward, especially in the case of viruses detected in environmental samples. Even for host-associated samples, it is not always correct to assign the sample origin as the host of the identified viruses. The process of assigning hosts to RNA viruses remains challenging due to their high mutation rates and vast diversity.Results In this study, we introduce RNAVirHost, a machine learning-based tool that predicts the hosts of RNA viruses solely based on viral genomes. RNAVirHost is a hierarchical classification framework that predicts hosts at different taxonomic levels. We demonstrate the superior accuracy of RNAVirHost in predicting hosts of RNA viruses through comprehensive comparisons with various state-of-the-art techniques. When applying to viruses from novel genera, RNAVirHost achieved the highest accuracy of 84.3%, outperforming the alignment-based strategy by 12.1%.Conclusions The application of machine learning models has proven beneficial in predicting hosts of RNA viruses. By integrating genomic traits and sequence homologies, RNAVirHost provides a cost-effective and efficient strategy for host prediction. We believe that RNAVirHost can greatly assist in RNA virus analyses and contribute to pandemic surveillance.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A novel method for the capture-based purification of whole viral native RNA genomes
    Cedric Chih Shen Tan
    Sebastian Maurer-Stroh
    Yue Wan
    October Michael Sessions
    Paola Florez de Sessions
    AMB Express, 9
  • [42] A novel method for the capture-based purification of whole viral native RNA genomes
    Tan, Cedric Chih Shen
    Maurer-Stroh, Sebastian
    Wan, Yue
    Sessions, October Michael
    de Sessions, Paola Florez
    AMB EXPRESS, 2019, 9 (1)
  • [43] A machine learning-based model for predicting paroxysmal and persistent atrial fibrillation based on EHR
    Zhang, Yuqi
    Li, Sijin
    Mai, Peibiao
    Yang, Yanqi
    Luo, Niansang
    Tong, Chao
    Zeng, Kuan
    Zhang, Kun
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2025, 25 (01)
  • [44] Development of a Machine Learning-Based Framework for Predicting Vessel Size Based on Container Capacity
    Chatterjee, Indranath
    Cho, Gyusung
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [45] A Machine Learning-Based Method to Identify Bipolar Disorder Patients
    Mateo-Sotos, J.
    Torres, A. M.
    Santos, J. L.
    Quevedo, O.
    Basar, C.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (04) : 2244 - 2265
  • [46] A machine learning-based usability evaluation method for eLearning systems
    Oztekin, Asil
    Delen, Dursun
    Turkyilmaz, Ali
    Zaim, Selim
    DECISION SUPPORT SYSTEMS, 2013, 56 : 63 - 73
  • [47] A Machine Learning-Based Method to Identify Bipolar Disorder Patients
    J. Mateo-Sotos
    A. M. Torres
    J. L. Santos
    O. Quevedo
    C. Basar
    Circuits, Systems, and Signal Processing, 2022, 41 : 2244 - 2265
  • [48] Machine Learning-based Macrophage Signature for Predicting Prognosis and Immunotherapy Benefits in Cholangiocarcinoma
    Huang, Junkai
    Chen, Yu
    Tan, Zhiguo
    Song, Yinghui
    Chen, Kang
    Liu, Sulai
    Peng, Chuang
    Chen, Xu
    CURRENT MEDICINAL CHEMISTRY, 2024,
  • [49] Machine learning-based models for predicting permeability impairment due to scale deposition
    Mohammadali Ahmadi
    Zhangxin Chen
    Journal of Petroleum Exploration and Production Technology, 2020, 10 : 2873 - 2884
  • [50] Development of a machine learning-based model for predicting individual responses to antihypertensive treatments
    Yi, Jiayi
    Wang, Lili
    Song, Jiali
    Liu, Yanchen
    Liu, Jiamin
    Zhang, Haibo
    Lu, Jiapeng
    Zheng, Xin
    NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES, 2024, 34 (07) : 1660 - 1669