Sample size calculation for data reliability and diagnostic performance: a go-to review

被引:7
|
作者
Monti, Caterina Beatrice [1 ]
Ambrogi, Federico [2 ,3 ]
Sardanelli, Francesco [3 ,4 ]
机构
[1] Univ Milan, Postgrad Sch Radiodiagnost, Milan, Italy
[2] Univ Milan, Dept Clin Sci & Community Hlth, Milan, Italy
[3] IRCCS Policlin San Donato, Milan, Italy
[4] Lega Italiana lotta contro & tumori LILT Milano Mo, Milan, Italy
关键词
Data science; Reproducibility of results; ROC curve; Sample size; Sensitivity and specificity; OPERATING CHARACTERISTIC CURVES; REQUIREMENTS; DESIGN;
D O I
10.1186/s41747-024-00474-w
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Sample size, namely the number of subjects that should be included in a study to reach the desired endpoint and statistical power, is a fundamental concept of scientific research. Indeed, sample size must be planned a priori, and tailored to the main endpoint of the study, to avoid including too many subjects, thus possibly exposing them to additional risks while also wasting time and resources, or too few subjects, failing to reach the desired purpose. We offer a simple, go-to review of methods for sample size calculation for studies concerning data reliability (repeatability/reproducibility) and diagnostic performance. For studies concerning data reliability, we considered Cohen's kappa or intraclass correlation coefficient (ICC) for hypothesis testing, estimation of Cohen's kappa or ICC, and Bland-Altman analyses. With regards to diagnostic performance, we considered accuracy or sensitivity/specificity versus reference standards, the comparison of diagnostic performances, and the comparisons of areas under the receiver operating characteristics curve. Finally, we considered the special cases of dropouts or retrospective case exclusions, multiple endpoints, lack of prior data estimates, and the selection of unusual thresholds for alpha and beta errors. For the most frequent cases, we provide example of software freely available on the Internet.Relevance statement Sample size calculation is a fundamental factor influencing the quality of studies on repeatability/reproducibility and diagnostic performance in radiology.Key points center dot Sample size is a concept related to precision and statistical power.center dot It has ethical implications, especially when patients are exposed to risks.center dot Sample size should always be calculated before starting a study.center dot This review offers simple, go-to methods for sample size calculations.
引用
收藏
页数:13
相关论文
共 22 条
  • [1] Estimation and Sample Size Calculation for Service Utilization Data
    Bhaumik, Dulal K.
    Aryal, Subhash
    STATISTICS AND APPLICATIONS, 2020, 18 (02): : 263 - 274
  • [2] Sample size calculation should be performed for design accuracy in diagnostic test studies
    Flahault, A
    Cadilhac, M
    Thomas, G
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2005, 58 (08) : 859 - 862
  • [3] Influence of data sampling on confidence in the calculation of reliability index for simple performance functions
    Bathurst, Richard J.
    Chenari, Reza Jamshidi
    COMPUTERS AND GEOTECHNICS, 2024, 166
  • [4] Sample size calculation for recurrent event data with additive rates models
    Zhu, Liang
    Li, Yimei
    Tang, Yongqiang
    Shen, Liji
    Onar-Thomas, Arzu
    Sun, Jianguo
    PHARMACEUTICAL STATISTICS, 2022, 21 (01) : 89 - 102
  • [5] Exemplary data set sample size calculation for Wilcoxon-Mann-Whitney tests
    Divine, George
    Kapke, Alissa
    Havstad, Suzanne
    Joseph, Christine L. M.
    STATISTICS IN MEDICINE, 2010, 29 (01) : 108 - 115
  • [6] Sample size calculation for estimating key epidemiological parameters using serological data and mathematical modelling
    Blaizot, Stephanie
    Herzog, Sereina A.
    Abrams, Steven
    Theeten, Heidi
    Litzroth, Amber
    Hens, Niel
    BMC MEDICAL RESEARCH METHODOLOGY, 2019, 19 (1)
  • [7] Sample size calculation for estimating key epidemiological parameters using serological data and mathematical modelling
    Stéphanie Blaizot
    Sereina A. Herzog
    Steven Abrams
    Heidi Theeten
    Amber Litzroth
    Niel Hens
    BMC Medical Research Methodology, 19
  • [8] Sample Size Calculation for Count Data in Comparative Clinical Trials with Nonuniform Patient Accrual and Early Dropout
    Li, Huiling
    Wang, Lin
    Wei, Lynn
    Quan, Hui
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2015, 25 (01) : 1 - 15
  • [9] Sample size and predictive performance of machine learning methods with survival data: A simulation study
    Infante, Gabriele
    Miceli, Rosalba
    Ambrogi, Federico
    STATISTICS IN MEDICINE, 2023, 42 (30) : 5657 - 5675
  • [10] Sample size calculation based on exact test for assessing differential expression analysis in RNA-seq data
    Li, Chung-I
    Su, Pei-Fang
    Shyr, Yu
    BMC BIOINFORMATICS, 2013, 14