Predicting Classifier Performance with Limited Training Data: Applications to Computer-Aided Diagnosis in Breast and Prostate Cancer

被引:7
作者
Basavanhally, Ajay [1 ]
Viswanath, Satish [1 ]
Madabhushi, Anant [1 ]
机构
[1] Case Western Reserve Univ, Dept Biomed Engn, Cleveland, OH 44106 USA
基金
美国国家卫生研究院;
关键词
GRADE; BIAS;
D O I
10.1371/journal.pone.0117900
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clinical trials increasingly employ medical imaging data in conjunction with supervised classifiers, where the latter require large amounts of training data to accurately model the system. Yet, a classifier selected at the start of the trial based on smaller and more accessible datasets may yield inaccurate and unstable classification performance. In this paper, we aim to address two common concerns in classifier selection for clinical trials: (1) predicting expected classifier performance for large datasets based on error rates calculated from smaller datasets and (2) the selection of appropriate classifiers based on expected performance for larger datasets. We present a framework for comparative evaluation of classifiers using only limited amounts of training data by using random repeated sampling (RRS) in conjunction with a cross-validation sampling strategy. Extrapolated error rates are subsequently validated via comparison with leave-one-out cross-validation performed on a larger dataset. The ability to predict error rates as dataset size increases is demonstrated on both synthetic data as well as three different computational imaging tasks: detecting cancerous image regions in prostate histopathology, differentiating high and low grade cancer in breast histopathology, and detecting cancerous metavoxels in prostate magnetic resonance spectroscopy. For each task, the relationships between 3 distinct classifiers (k-nearest neighbor, naive Bayes, Support Vector Machine) are explored. Further quantitative evaluation in terms of interquartile range (IQR) suggests that our approach consistently yields error rates with lower variability (mean IQRs of 0.0070, 0.0127, and 0.0140) than a traditional RRS approach (mean IQRs of 0.0297, 0.0779, and 0.305) that does not employ cross-validation sampling for all three datasets.
引用
收藏
页数:18
相关论文
共 30 条
[1]  
Adcock CJ, 1997, J ROY STAT SOC D-STA, V46, P261
[2]   Texture measures combination for improved meningioma classification of histopathological images [J].
Al-Kadi, Omar S. .
PATTERN RECOGNITION, 2010, 43 (06) :2043-2053
[3]  
[Anonymous], 2000, Pattern Classification
[4]   Multi-Field-of-View Framework for Distinguishing Tumor Grade in ER plus Breast Cancer From Entire Histopathology Slides [J].
Basavanhally, Ajay ;
Ganesan, Shridar ;
Feldman, Michael ;
Shih, Natalie ;
Mies, Carolyn ;
Tomaszewski, John ;
Madabhushi, Anant .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2013, 60 (08) :2089-2099
[5]   PREDICTING CLASSIFIER PERFORMANCE WITH A SMALL TRAINING SET: APPLICATIONS TO COMPUTER-AIDED DIAGNOSIS AND PROGNOSIS [J].
Basavanhally, Ajay ;
Doyle, Scott ;
Madabhushi, Anant .
2010 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2010, :229-232
[6]   Avoiding model selection bias in small-sample genomic datasets [J].
Berrar, D ;
Bradbury, I ;
Dubitzky, W .
BIOINFORMATICS, 2006, 22 (10) :1245-1250
[7]  
Breiman L, 1996, ANN STAT, V24, P2350
[8]   Quantification of heterogeneity observed in medical images [J].
Brooks, Frank J. ;
Grigsby, Perry W. .
BMC MEDICAL IMAGING, 2013, 13
[9]   A study on the performances of dynamic classifier selection based on local accuracy estimation [J].
Didaci, L ;
Giacinto, G ;
Roli, F ;
Marcialis, GL .
PATTERN RECOGNITION, 2005, 38 (11) :2188-2191
[10]   A Boosted Bayesian Multiresolution Classifier for Prostate Cancer Detection From Digitized Needle Biopsies [J].
Doyle, Scott ;
Feldman, Michael ;
Tomaszewski, John ;
Madabhushi, Anant .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (05) :1205-1218