How is test laboratory data used and characterised by machine learning models? A systematic review of diagnostic and prognostic models developed for COVID-19 patients using only laboratory data

被引:24
作者
Carobene, Anna [1 ]
Milella, Frida [2 ]
Famiglini, Lorenzo [3 ]
Cabitza, Federico [2 ,3 ]
机构
[1] IRCCS San Raffaele Sci Inst, Lab Med, Via Olgettina 60, I-20132 Milan, Italy
[2] IRCCS Ist Ortoped Galeazzi, Milan, Italy
[3] Univ Milano Bicocca, DISCo, Milan, Italy
关键词
complete blood count (CBC); COVID-19; diagnostic study; laboratory tests; machine learning; prognostic study; SARS-CoV-2; CREATININE DETERMINATION; BIOLOGICAL VARIATION; EXTERNAL VALIDATION; CRITICAL-APPRAISAL; STANDARDIZATION; CHECKLIST; QUALITY; SERUM;
D O I
10.1515/cclm-2022-0182
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
The current gold standard for COVID-19 diagnosis, the rRT-PCR test, is hampered by long turnaround times, probable reagent shortages, high false-negative rates and high prices. As a result, machine learning (ML) methods have recently piqued interest, particularly when applied to digital imagery (X-rays and CT scans). In this review, the literature on ML-based diagnostic and prognostic studies grounded on hematochemical parameters has been considered. By doing so, a gap in the current literature was addressed concerning the application of machine learning to laboratory medicine. Sixty-eight articles have been included that were extracted from the Scopus and PubMed indexes. These studies were marked by a great deal of heterogeneity in terms of the examined laboratory test and clinical parameters, sample size, reference populations, ML algorithms, and validation approaches. The majority of research was found to be hampered by reporting and replicability issues: only four of the surveyed studies provided complete information on analytic procedures (units of measure, analyzing equipment), while 29 provided no information at all. Only 16 studies included independent external validation. In light of these findings, we discuss the importance of closer collaboration between data scientists and medical laboratory professionals in order to correctly characterise the relevant population, select the most appropriate statistical and analytical methods, ensure reproducibility, enable the proper interpretation of the results, and gain actual utility by using machine learning methods in clinical practice.
引用
收藏
页码:1887 / 1901
页数:15
相关论文
共 56 条
[1]   The Biological Variation Data Critical Appraisal Checklist: A Standard for Evaluating Studies on Biological Variation [J].
Aarsand, Aasne K. ;
Roraas, Thomas ;
Fernandez-Calle, Pilar ;
Ricos, Carmen ;
Diaz-Garzon, Jorge ;
Jonker, Niels ;
Perich, Carmen ;
Gonzalez-Lao, Elisabet ;
Carobene, Anna ;
Minchinela, Joana ;
Coskun, Abdurrahman ;
Simon, Margarita ;
Alvarez, Virtudes ;
Bartlett, William A. ;
Fernandez-Fernandez, Pilar ;
Boned, Beatriz ;
Braga, Federica ;
Corte, Zoraida ;
Aslan, Berna ;
Sandberg, Sverre .
CLINICAL CHEMISTRY, 2018, 64 (03) :501-514
[2]   Hemogram data as a tool for decision-making in COVID-19 management: applications to resource scarcity scenarios [J].
Avila, Eduardo ;
Kahmann, Alessandro ;
Alho, Clarice ;
Dorn, Marcio .
PEERJ, 2020, 8
[3]   Machine Learning for Clinical Chemists [J].
Badrick, Tony ;
Banfi, Giuseppe ;
Bietenbeck, Andreas ;
Cervinski, Mark A. ;
Loh, Tze Ping ;
Sikaris, Ken .
CLINICAL CHEMISTRY, 2019, 65 (11) :1350-1356
[4]   Use of Machine Learning and Artificial Intelligence to predict SARS-CoV-2 infection from Full Blood Counts in a population [J].
Banerjee, Abhirup ;
Ray, Surajit ;
Vorselaars, Bart ;
Kitson, Joanne ;
Mamalakis, Michail ;
Weeks, Simonne ;
Baker, Mark ;
Mackenzie, Louise S. .
INTERNATIONAL IMMUNOPHARMACOLOGY, 2020, 86
[5]   A checklist for critical appraisal of studies of biological variation [J].
Bartlett, William A. ;
Braga, Federica ;
Carobene, Anna ;
Coskun, Abdurrahman ;
Prusa, Richard ;
Fernandez-Calle, Pilar ;
Roraas, Thomas ;
Jonker, Neils ;
Sandberg, Sverre .
CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2015, 53 (06) :879-885
[6]  
Bossuyt PM, 2003, CROAT MED J, V44, P639
[7]   Short- and medium-term biological variation estimates of red blood cell and reticulocyte parameters in healthy subjects [J].
Buoro, Sabrina ;
Carobene, Anna ;
Seghezzi, Michela ;
Manenti, Barbara ;
Dominoni, Paola ;
Pacioni, Aurelio ;
Ceriotti, Ferruccio ;
Ottomano, Cosimo ;
Lippi, Giuseppe .
CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2018, 56 (06) :954-963
[8]   Short- and medium-term biological variation estimates of leukocytes extended to differential count and morphology-structural parameters (cell population data) in blood samples obtained from healthy people [J].
Buoro, Sabrina ;
Carobene, Anna ;
Seghezzi, Michela ;
Manenti, Barbara ;
Pacioni, Aurelio ;
Ceriotti, Ferruccio ;
Ottomano, Cosimo ;
Lippi, Giuseppe .
CLINICA CHIMICA ACTA, 2017, 473 :147-156
[9]   The importance of being external. methodological insights for the external validation of machine learning models in medicine [J].
Cabitza, Federico ;
Campagner, Andrea ;
Soares, Felipe ;
Garcia de Guadiana-Romualdo, Luis ;
Challa, Feyissa ;
Sulejmani, Adela ;
Seghezzi, Michela ;
Carobene, Anna .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
[10]   Development, evaluation, and validation of machine learning models for COVID-19 detection based on routine blood tests [J].
Cabitza, Federico ;
Campagner, Andrea ;
Ferrari, Davide ;
Di Resta, Chiara ;
Ceriotti, Daniele ;
Sabetta, Eleonora ;
Colombini, Alessandra ;
De Vecchi, Elena ;
Banfi, Giuseppe ;
Locatelli, Massimo ;
Carobene, Anna .
CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2021, 59 (02) :421-431