Exact misclassification probabilities for plug-in normal quadratic discriminant functions II. The heterogeneous case

被引:20
作者
McFarland, HR
Richards, DS
机构
[1] Inst Adv Study, Princeton, NJ 08540 USA
[2] Univ Virginia, Charlottesville, VA 22903 USA
关键词
apparent error rate; Bessel function of matrix argument; corporate financial data; cross-validation; diabetes data; discrimmant analysis; hold-out method; iris data; misclassification probability; multivariate gamma function; multivariate norinal distribution; resubstitution method; stochastic representation; Wishart distribution;
D O I
10.1006/jmva.2001.2034
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of discriminating between two independent multivariate normal populations, N-p(mu(1), Sigma(1)) and N-p(mu(2), Sigma(2)) having distinct mean vectors mu(1) and mu(2) and distinct covariance matrices Sigma(2), and Sigma(2). The parameters mu(1), mu(2), Sigma(1), and Sigma(2) are unknown and are estimated by means of independent random training samples from each population. We derive a stochastic representation for the exact distribution of the "plug-in" quadratic discriminant function for classifying a new observation between the two populations. The stochastic representation involves only the classical standard normal, chi-square, and F distributions and is easily implemented for simulation purposes. Using Monte Carlo simulation of the stochastic representation we provide applications to the estimation of misclassification probabilities for the well-known iris data studied by Fisher (Ann. Eugen. 7 (1936), 179-188); a data set on corporate financial ratios provided by Johnson and Wichern (Applied Multivariate Statistical Analysis, 4th ed., Prentice-Hall, Englewood Cliffs, NJ, 1998); and a data set analyzed by Reaven and Miller (Diabetologia 16 (1979), 17-24) in a classification of diabetic status. (C) 2002 Elsevier Science (USA).
引用
收藏
页码:299 / 330
页数:32
相关论文
共 20 条
[1]  
Anderson T., 1984, INTRO MULTIVARIATE S
[2]  
Andrews D. F., 1985, Springer Series in Statistics
[3]  
[Anonymous], 1961, STUDIES ITEM ANAL PR
[4]  
[Anonymous], 1999, APPL MULTIVARIATE AN
[5]  
[Anonymous], 1961, OSAKA MATH J
[6]  
BARTLETT MS, 1963, BIOMETRIKA, V50, P17
[7]   TRANSFORMATION CONSIDERATIONS IN DISCRIMINANT-ANALYSIS [J].
BEAUCHAMP, JJ ;
ROBSON, DS .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1986, 15 (01) :147-179
[8]   ROBUSTNESS OF THE LINEAR DISCRIMINANT FUNCTION TO NON-NORMALITY - JOHNSONS SYSTEM [J].
CHINGANDA, EF ;
SUBRAHMANIAM, K .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1979, 3 (01) :69-77
[9]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[10]   BESSEL FUNCTIONS OF MATRIX ARGUMENT [J].
HERZ, CS .
ANNALS OF MATHEMATICS, 1955, 61 (03) :474-523