Protein Sequence-Based COVID-19 Detection: A Comparative Study of Machine Learning Classification Methods

被引:0
|
作者
Aminah, Siti [1 ]
Ardaneswari, Gianinna [1 ]
Awang, Mohd Khalid [2 ]
Yusaputra, Muhammad Ariq [1 ]
Sari, Dian Puspita [1 ]
机构
[1] Univ Indonesia, Fac Math & Nat Sci, Dept Math, Depok 16424, Indonesia
[2] Univ Sultan Zainal Abidin, Fac Informat & Comp, Besut 22200, Terengganu, Malaysia
关键词
Compendex;
D O I
10.1155/2024/8683822
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Coronaviruses, including severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), continue to pose a significant public health challenge globally, even in 2024. Despite advancements in vaccines and treatments, the accurate classification of coronavirus protein sequences remains crucial for monitoring variants, understanding viral behavior, and developing targeted interventions. In this study, we investigate the efficacy of various classification methods in accurately classifying coronavirus protein sequences. We explore the use of K-nearest neighbor (KNN), fuzzy KNN (FKNN), support vector machine (SVM), and SVM with particle swarm optimization (PSO-SVM) algorithms for classification, complemented by feature selection techniques including principal component analysis (PCA) and random forest-recursive feature elimination (RF-RFE). Our dataset comprises 2000 protein sequences, evenly split between SARS-CoV-2 and non-SARS-CoV-2 sequences. Through rigorous analysis, we evaluate the performance of each classification model in terms of accuracy, sensitivity, specificity, and receiver operating characteristic area under the curve (ROC-AUC). Our findings demonstrate consistently high performance across all models, reflecting their efficacy in classifying coronavirus protein sequences. Notably, the PCA + PSO-SVM model emerges as the top-performing model, exhibiting the highest classification accuracy, specificity, and ROC-AUC score, demonstrating its effectiveness in distinguishing between SARS-CoV-2 and non-SARS-CoV-2 sequences. Overall, our study highlights the importance of employing advanced classification methods and feature selection techniques in accurately classifying coronavirus protein sequences. The findings provide valuable insights for researchers and practitioners in the field of bioinformatics and contribute to ongoing efforts in understanding and combating the COVID-19 pandemic and its evolving challenges.
引用
收藏
页数:14
相关论文
共 16 条
  • [1] Automated Detection of COVID-19 in Chest Radiographs: Leveraging Machine Learning Approaches
    Batool, Raheela
    Raza, Ghulam Musa
    Khalid, Usman
    Kim, Byung-Seo
    IEIE Transactions on Smart Processing and Computing, 2024, 13 (06): : 572 - 578
  • [2] Computational predictions for protein sequences of COVID-19 virus via machine learning algorithms
    Afify, Heba M.
    Zanaty, Muhammad S.
    Medical and Biological Engineering and Computing, 2021, 59 (09): : 1723 - 1734
  • [3] Systematic Literature Review: Machine Learning Prediction Model for Covid-19 Spreading
    Nastiti, Faulinda Ely
    Musa, Shahrulniza
    Yafi, Eiad
    Chauhan, Ritu
    2022 4th International Conference on Cybernetics and Intelligent System, ICORIS 2022, 2022,
  • [4] Multihead Text Mining from COVID-19 Feedback Using Machine Learning, Deep Learning, and Hybrid Deep Learning Approaches
    Kobra, Khadijatul
    Sammi, Samrina Sarkar
    Rahman, Naimur
    Khushbu, Sharun Akter
    Islam, Mirajul
    JOURNAL OF SENSORS, 2024, 2024
  • [5] Attention-based bidirectional LSTM with embedding technique for classification of COVID-19 articles
    Dutta, Rakesh
    Majumder, Mukta
    Intelligent Decision Technologies, 2022, 16 (01) : 205 - 215
  • [6] A comparative study to choose the appropriate growth model to forecast COVID-19 cases in Iraq
    Hussain, Jassim N.
    Journal of Physics: Conference Series, 2022, 2322 (01)
  • [7] A review of different deep learning methods in processing the CT scan images of the COVID-19 patients’ lungs
    Zhang, Haixia
    Multiscale and Multidisciplinary Modeling, Experiments and Design, 7 (03): : 2001 - 2015
  • [8] Transfer Learning for Automatic Detection of COVID-19 Disease in Medical Chest X-ray Images
    Youssra, El Idrissi El-Bouzaidi
    Otman, Abdoun
    IAENG International Journal of Computer Science, 2022, 49 (02)
  • [9] Detection of Social Media Hashtag Hijacking Using Dictionary-based and Machine Learning Methods
    Cheah, Wei Ling
    Chua, Hui Na
    4th IEEE International Conference on Artificial Intelligence in Engineering and Technology, IICAIET 2022, 2022,
  • [10] Analytical Study of Deep Learning-Based Preventive Measures of COVID-19 for Decision Making and Aggregation via the RISTECB Model
    Ahmad, Ishfaq
    Xu, Sheng Jun
    Khatoon, Amna
    Tariq, Usman
    Khan, Inayat
    Rizvi, Sanam Shahla
    Ullah, Asad
    SCIENTIFIC PROGRAMMING, 2022, 2022