Sample-Size Determination Methodologies for Machine Learning in Medical Imaging Research: A Systematic Review

被引:176
作者
Balki, Indranil [1 ]
Amirabadi, Afsaneh [1 ,2 ]
Levman, Jacob [3 ,4 ]
Martel, Anne L. [5 ]
Emersic, Ziga [6 ]
Meden, Blaz [6 ]
Garcia-Pedrero, Angel [7 ,8 ]
Ramirez, Saul C. [9 ]
Kong, Dehan [10 ]
Moody, Alan R. [1 ]
Tyrrell, Pascal N. [1 ,10 ]
机构
[1] Univ Toronto, Dept Med Imaging, 263 McCaul St,4th Floor Room 409, Toronto, ON M5T 1W7, Canada
[2] Hosp Sick Children, Dept Diagnost Imaging, Toronto, ON, Canada
[3] St Francis Xavier Univ, Dept Math Stat & Comp Sci, Antigonish, NS, Canada
[4] Harvard Med Sch, Boston Childrens Hosp, Boston, MA 02115 USA
[5] Univ Toronto, Dept Med Biophys, Toronto, ON, Canada
[6] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana, Slovenia
[7] Univ Valladolid, Dept Bot, Castile, Spain
[8] Univ Valladolid, Dept Bot, Leon, Spain
[9] Inst Tecnol Costa Rica, Comp Sch, Cartago, Costa Rica
[10] Univ Toronto, Dept Stat Sci, Toronto, ON, Canada
来源
CANADIAN ASSOCIATION OF RADIOLOGISTS JOURNAL-JOURNAL DE L ASSOCIATION CANADIENNE DES RADIOLOGISTES | 2019年 / 70卷 / 04期
关键词
Sample size; Machine learning; Medical imaging; Radiology; COMPUTER-AIDED DIAGNOSIS; CLASSIFIER DESIGN; ALZHEIMERS-DISEASE; FEATURE-SELECTION; TRAINING SET; PERFORMANCE; MODEL;
D O I
10.1016/j.carj.2019.06.002
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: The required training sample size for a particular machine learning (ML) model applied to medical imaging data is often unknown. The purpose of this study was to provide a descriptive review of current sample-size determination methodologies in ML applied to medical imaging and to propose recommendations for future work in the field. Methods: We conducted a systematic literature search of articles using Medline and Embase with keywords including "machine learning," "image," and "sample size." The search included articles published between 1946 and 2018. Data regarding the ML task, sample size, and train-test pipeline were collected. Results: A total of 167 articles were identified, of which 22 were included for qualitative analysis. There were only 4 studies that discussed sample-size determination methodologies, and 18 that tested the effect of sample size on model performance as part of an exploratory analysis. The observed methods could be categorized as pre hoc model-based approaches, which relied on features of the algorithm, or post hoc curve-fitting approaches requiring empirical testing to model and extrapolate algorithm performance as a function of sample size. Between studies, we observed great variability in performance testing procedures used for curve-fitting, model assessment methods, and reporting of confidence in sample sizes. Conclusions: Our study highlights the scarcity of research in training set size determination methodologies applied to ML in medical imaging, emphasizes the need to standardize current reporting practices, and guides future work in development and streamlining of pre hoc and post hoc sample size approaches.
引用
收藏
页码:344 / 353
页数:10
相关论文
共 50 条
  • [1] Effects of hardware heterogeneity on the performance of SVM Alzheimer's disease classifier
    Abdulkadir, Ahmed
    Mortamet, Benedicte
    Vemuri, Prashanthi
    Jack, Clifford R., Jr.
    Krueger, Gunnar
    Kloeppel, Stefan
    [J]. NEUROIMAGE, 2011, 58 (03) : 785 - 792
  • [2] [Anonymous], ASSESSMENT NEED SEPA
  • [3] [Anonymous], P IEEE COMP SOC C CO
  • [4] [Anonymous], 1998, NEURAL NETWORKS COMP
  • [5] What Size Net Gives Valid Generalization?
    Baum, Eric B.
    Haussler, David
    [J]. NEURAL COMPUTATION, 1989, 1 (01) : 151 - 160
  • [6] A general model for finite-sample effects in training and testing of competing classifiers
    Beiden, SV
    Maloof, MA
    Wagner, RF
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (12) : 1561 - 1569
  • [7] Sample size planning for classification models
    Beleites, Claudia
    Neugebauer, Ute
    Bocklitz, Thomas
    Krafft, Christoph
    Popp, Juergen
    [J]. ANALYTICA CHIMICA ACTA, 2013, 760 : 25 - 33
  • [8] Statistics in brief:: The importance of sample size in the planning and interpretation of medical research
    Biau, David Jean
    Kerneis, Solen
    Porcher, Raphael
    [J]. CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2008, 466 (09) : 2282 - 2288
  • [9] Effects of sample size on classifier design: Quadratic and neural network classifiers
    Chan, HP
    Sahiner, B
    Wagner, RF
    Petrick, N
    Mossoba, J
    [J]. IMAGE PROCESSING - MEDICAL IMAGING 1997, PTS 1 AND 2, 1997, 3034 : 1102 - 1113
  • [10] Effects of sample size on classifier design for computer-aided diagnosis
    Chan, HP
    Sahiner, B
    Wagner, RF
    Petrick, N
    [J]. MEDICAL IMAGING 1998: IMAGE PROCESSING, PTS 1 AND 2, 1998, 3338 : 845 - 858