RNAVirHost: a machine learning-based method for predicting hosts of RNA viruses through viral genomes

被引:0
作者
Chen, Guowei [1 ]
Jiang, Jingzhe [2 ]
Sun, Yanni [1 ]
机构
[1] City Univ Hong Kong, Dept Elect Engn, Kowloon, 83 Tat Chee Ave, Hong Kong, Peoples R China
[2] Chinese Acad Fishery Sci, South China Sea Fisheries Res Inst, Key Lab South China Sea Fishery Resources Exploita, Minist Agr & Rural Affairs, Guangzhou 510300, Peoples R China
来源
GIGASCIENCE | 2024年 / 13卷
关键词
RNA virus; host prediction; machine learning; metagenomics; MOLECULAR CHARACTERIZATION; VECTORS;
D O I
10.1093/gigascience/giae059
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background The high-throughput sequencing technologies have revolutionized the identification of novel RNA viruses. Given that viruses are infectious agents, identifying hosts of these new viruses carries significant implications for public health and provides valuable insights into the dynamics of the microbiome. However, determining the hosts of these newly discovered viruses is not always straightforward, especially in the case of viruses detected in environmental samples. Even for host-associated samples, it is not always correct to assign the sample origin as the host of the identified viruses. The process of assigning hosts to RNA viruses remains challenging due to their high mutation rates and vast diversity.Results In this study, we introduce RNAVirHost, a machine learning-based tool that predicts the hosts of RNA viruses solely based on viral genomes. RNAVirHost is a hierarchical classification framework that predicts hosts at different taxonomic levels. We demonstrate the superior accuracy of RNAVirHost in predicting hosts of RNA viruses through comprehensive comparisons with various state-of-the-art techniques. When applying to viruses from novel genera, RNAVirHost achieved the highest accuracy of 84.3%, outperforming the alignment-based strategy by 12.1%.Conclusions The application of machine learning models has proven beneficial in predicting hosts of RNA viruses. By integrating genomic traits and sequence homologies, RNAVirHost provides a cost-effective and efficient strategy for host prediction. We believe that RNAVirHost can greatly assist in RNA virus analyses and contribute to pandemic surveillance.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A novel, effective machine learning-based RNA editing profile for predicting the prognosis of lower-grade gliomas
    Wang, Boshen
    Tian, Peijie
    Sun, Qianyu
    Zhang, Hengdong
    Han, Lei
    Zhu, Baoli
    HELIYON, 2023, 9 (07)
  • [22] A machine learning-based underwater noise classification method
    Song, Guoli
    Guo, Xinyi
    Wang, Wenbo
    Ren, Qunyan
    Li, Jun
    Ma, Li
    APPLIED ACOUSTICS, 2021, 184
  • [23] A Machine Learning-Based Method for Detecting Liver Fibrosis
    Suarez, Miguel
    Martinez, Raquel
    Torres, Ana Maria
    Ramon, Antonio
    Blasco, Pilar
    Mateo, Jorge
    DIAGNOSTICS, 2023, 13 (18)
  • [24] A Machine Learning-Based Wrapper Method for Feature Selection
    Patel, Damodar
    Saxena, Amit
    Wang, John
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2024, 20 (01)
  • [25] A Machine Learning-based Method for Cyber Risk Assessment
    Rafaiani, Giulia
    Battaglioni, Massimo
    Compagnoni, Simone
    Senigagliesi, Linda
    Chiaraluce, Franco
    Baldi, Marco
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 263 - 268
  • [26] Machine Learning-Based Attack Detection Method in Hadoop
    Li, Ningwei
    Gao, Hang
    Liu, Liang
    Peng, Jianfei
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT III, 2020, 12454 : 184 - 196
  • [27] Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides
    Xu, Jing
    Li, Fuyi
    Leier, Andre
    Xiang, Dongxu
    Shen, Hsin-Hui
    Lago, Tatiana T. Marquez
    Li, Jian
    Yu, Dong-Jun
    Song, Jiangning
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [28] Machine learning-based approach for predicting the consolidation characteristics of soft soil
    Singh, Moirangthem Johnson
    Kaushik, Anshul
    Patnaik, Gyanesh
    Xu, Dong-Sheng
    Feng, Wei-Qiang
    Rajput, Abhishek
    Prakash, Guru
    Borana, Lalit
    MARINE GEORESOURCES & GEOTECHNOLOGY, 2024, 42 (04) : 405 - 419
  • [29] A machine learning-based analysis for predicting fragility curve parameters of buildings
    Dabiri, Hamed
    Faramarzi, Asaad
    Dall 'Asta, Andrea
    Tondi, Emanuele
    Micozzi, Fabio
    JOURNAL OF BUILDING ENGINEERING, 2022, 62
  • [30] A Machine Learning-Based Method for Predicting End-Bearing Capacity of Rock-Socketed Shafts
    Haohua Chen
    Lianyang Zhang
    Rock Mechanics and Rock Engineering, 2022, 55 : 1743 - 1757