Comparison of machine learning techniques to handle imbalanced COVID-19 CBC datasets

被引:11
作者
Dorn, Marcio [1 ,2 ,3 ]
Grisci, Bruno Iochins [1 ]
Narloch, Pedro Henrique [1 ]
Feltes, Bruno Cesar [1 ,4 ]
Avila, Eduardo [3 ,5 ]
Kahmann, Alessandro [6 ]
Alho, Clarice Sampaio [3 ,5 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
[2] Univ Fed Rio Grande do Sul, Ctr Biotechnol, Porto Alegre, RS, Brazil
[3] Natl Inst Sci & Technol, Forens Sci, Porto Alegre, RS, Brazil
[4] Univ Fed Rio Grande do Sul, Dept Genet, Porto Alegre, RS, Brazil
[5] Pontificia Univ Catolica Rio Grande do Sul, Sch Hlth & Life Sci, Porto Alegre, RS, Brazil
[6] Fed Univ Rio Grande, Inst Math Stat & Phys, Rio Grande, RS, Brazil
关键词
Machine learning; Data mining; Imbalanced datasets; Covid; Hemogram; CORONAVIRUS DISEASE 2019; CLASSIFICATION; WAVE; TREES; RISK; CT;
D O I
10.7717/peerj-cs.670
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Coronavirus pandemic caused by the novel SARS-CoV-2 has significantly impacted human health and the economy, especially in countries struggling with financial resources for medical testing and treatment, such as Brazil's case, the third most affected country by the pandemic. In this scenario, machine learning techniques have been heavily employed to analyze different types of medical data, and aid decision making, offering a low-cost alternative. Due to the urgency to fight the pandemic, a massive amount of works are applying machine learning approaches to clinical data, including complete blood count (CBC) tests, which are among the most widely available medical tests. In this work, we review the most employed machine learning classifiers for CBC data, together with popular sampling methods to deal with the class imbalance. Additionally, we describe and critically analyze three publicly available Brazilian COVID-19 CBC datasets and evaluate the performance of eight classifiers and five sampling techniques on the selected datasets. Our work provides a panorama of which classifier and sampling methods provide the best results for different relevant metrics and discuss their impact on future analyses. The metrics and algorithms are introduced in a way to aid newcomers to the field. Finally, the panorama discussed here can significantly benefit the comparison of the results of new ML algorithms.
引用
收藏
页码:1 / 34
页数:34
相关论文
共 50 条
  • [21] Practical Machine Learning Techniques for COVID-19 Detection Using Chest
    Mangalmurti, Yurananatul
    Wattanapongsakorn, Naruemon
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (02) : 733 - 752
  • [22] Machine Learning Techniques and Forecasting Methods for Analyzing and Predicting Covid-19
    Alshabeeb, Israa Ali
    Azeez, Ruaa Majeed
    Shakir, Wafaa Mohammed Ridha
    INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2022, 17 (01) : 413 - 424
  • [23] Applying Different Machine Learning Techniques for Prediction of COVID-19 Severity
    Sayed, Safynaz Abdel-Fattah
    Elkorany, Abeer Mohamed
    Mohammad, Sabah Sayed
    IEEE ACCESS, 2021, 9 : 135697 - 135707
  • [24] Robust and efficient COVID-19 detection techniques: A machine learning approach
    Hasan, Md Mahadi
    Murtaz, Saba Binte
    Islam, Muhammad Usama
    Sadeq, Muhammad Jafar
    Uddin, Jasim
    PLOS ONE, 2022, 17 (09):
  • [25] Automatic COVID-19 prediction using explainable machine learning techniques
    Solayman S.
    Aumi S.A.
    Mery C.S.
    Mubassir M.
    Khan R.
    International Journal of Cognitive Computing in Engineering, 2023, 4 : 36 - 46
  • [26] A REVIEW ON EXTENSIVELY USED MACHINE LEARNING TECHNIQUES FOR THE PREDICTION OF COVID-19
    Mojahid, Hafiza Zoya
    Zain, Jasni Mohamad
    Basit, Abdul
    Yusoff, Marina
    Ali, Mushtaq
    SURANAREE JOURNAL OF SCIENCE AND TECHNOLOGY, 2024, 31 (01): : 030167 - 1
  • [27] Comparing different machine learning techniques for predicting COVID-19 severity
    Xiong, Yibai
    Ma, Yan
    Ruan, Lianguo
    Li, Dan
    Lu, Cheng
    Huang, Luqi
    INFECTIOUS DISEASES OF POVERTY, 2022, 11 (01)
  • [28] Intelligent internet of things and advanced machine learning techniques for covid-19
    Chakraborty C.
    Abougreen A.N.
    EAI Endorsed Transactions on Pervasive Health and Technology, 2021, 7 (26)
  • [29] Machine Learning and Image Processing Techniques for Covid-19 Detection: A Review
    Appari, Neeraj Venkatasai L.
    Kanojia, Mahendra G.
    Bangera, Kritik B.
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2021), 2022, 417 : 441 - 450
  • [30] Machine learning techniques as an efficient alternative tool for COVID-19 cases
    Bustos, Nicolas
    Tello, Manuel
    Droppelmann, Guillermo
    Garcia, Nicolas
    Feijoo, Felipe
    Leiva, Victor
    SIGNA VITAE, 2022, 18 (01) : 23 - 33