Classifier design for computer-aided diagnosis: Effects of finite sample size on the mean performance of classical and neural network classifiers

被引：129

作者：

Chan, HP

Sahiner, B

Wagner, RF

Petrick, N

机构：

[1] Univ Michigan, Dept Radiol, Ann Arbor, MI 48109 USA

[2] US FDA, Ctr Devices & Radiol Hlth, Rockville, MD 20852 USA

来源：

MEDICAL PHYSICS | 1999年 / 26卷 / 12期

关键词：

computer-aided diagnosis; classifier design; linear classifier; quadratic classifier; neural network; sample size; feature space dimensionality; ROC analysis;

D O I：

10.1118/1.598805

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

Classifier design is one of the key steps in the development of computer-aided diagnosis (CAD) algorithms. A classifier is designed with case samples drawn from the patient population. Generally, the sample size available for classifier design is limited, which introduces variance and bias into the performance of the trained classifier, relative to that obtained with an infinite sample size. For CAD applications, a commonly used performance index for a classifier is the area, A(z), under the receiver operating characteristic (ROC) curve. We have conducted a computer simulation study to investigate the dependence of the mean performance, in terms of A(z), on design sample size for a linear discriminant and two nonlinear classifiers, the quadratic discriminant and the backpropagation neural network (ANN). The performances of the classifiers were compared for four types of class distributions that have specific properties: multivariate normal distributions with equal covariance matrices and unequal means, unequal covariance matrices and unequal means, and unequal covariance matrices and equal means, and a feature space where the two classes were uniformly distributed in disjoint checkerboard regions. We evaluated the performances of the classifiers in feature spaces of dimensionality ranging from 3 to 15, and design sample sizes from 20 to 800 per class. The dependence of the resubstitution and hold-out performance on design (training) sample size (N-t) was investigated. For multivariate normal class distributions with equal covariance matrices, the linear discriminant is the optimal classifier. It was found that its A(z)-versus-1/N-t curves can be closely approximated by linear dependences over the range of sample sizes studied. In the feature spaces with unequal covariance matrices where the quadratic discriminant is optimal, the linear discriminant is inferior to the quadratic discriminant or the ANN when the design sample size is large. However, when the design sample is small, a relatively simple classifier, such as the linear discriminant or an ANN with very few hidden nodes, may be preferred because performance bias increases with the complexity of the classifier. In the regime where the classifier performance is dominated by the 1/N-t term, the performance in the limit of infinite sample size can be estimated as the intercept (1/N-t = 0) of a linear regression of A(z) versus 1/N-t. The understanding of the performance of the classifiers under the constraint of a finite design sample size is expected to facilitate the selection of a proper classifier for a given classification task and the design of an efficient resampling scheme. (C) 1999 American Association of Physicists in Medicine. [S0094-2405(99)00212-6].

引用

页码：2654 / 2668

页数：15

共 17 条

[1]

[Anonymous], 1975, Discriminant Analysis

[2]

BROWN DG, 1994, P SOC PHOTO-OPT INS, V2167, P180

[3] Effects of sample size on classifier design: Quadratic and neural network classifiers [J].

Chan, HP ;

Sahiner, B ;

Wagner, RF ;

Petrick, N ;

Mossoba, J .

IMAGE PROCESSING - MEDICAL IMAGING 1997, PTS 1 AND 2, 1997, 3034 :1102-1113

[4] Effects of sample size on classifier design for computer-aided diagnosis [J].

Chan, HP ;

Sahiner, B ;

Wagner, RF ;

Petrick, N .

MEDICAL IMAGING 1998: IMAGE PROCESSING, PTS 1 AND 2, 1998, 3338 :845-858

[5]

CHAN HP, 1997, MED PHYS, V24, P1034

[6] EFFECTS OF SAMPLE-SIZE IN CLASSIFIER DESIGN [J].

FUKUNAGA, K ;

HAYES, RR .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (08) :873-885

[7]

Fukunaga K., 1990, INTRO STAT PATTERN R

[8]

Hand D.J., 1981, DISCRIMINATION CLASS

[9]

Johnson R.A., 1982, APPL MULTIVARIATE ST

[10]

Raudys S, 1980, IEEE Trans Pattern Anal Mach Intell, V2, P242, DOI 10.1109/TPAMI.1980.4767011

← 1 2 →