Classification trees as an alternative to linear discriminant analysis

被引:87
作者
Feldesman, MR [1 ]
机构
[1] Portland State Univ, Dept Anthropol, Portland, OR 97207 USA
关键词
multivariate analysis; binary recursive trees; morphometrics; rpart; open source software; missing values imputation; discriminant analysis;
D O I
10.1002/ajpa.10102
中图分类号
Q98 [人类学];
学科分类号
030303 ;
摘要
Linear discriminant analysis (LDA) is frequently used for classification/prediction problems in physical anthropology, but it is unusual to find examples where researchers consider the statistical limitations and assumptions required for this technique. In these instances, it is difficult to know whether the predictions are reliable. This paper considers a nonparametric alternative to predictive LDA: binary, recursive (or classification) trees. This approach has the advantage that data transformation is unnecessary, cases with missing predictor variables do not require special treatment, prediction success is not dependent on data meeting normality conditions or covariance homogeneity, and variable selection is intrinsic to the methodology. Here I compare the efficacy of classification trees with LDA, using typical morphometric data. With data from modem hominoids, the results show that both techniques perform nearly equally. With complete data sets, LDA may be a better choice, as is shown in this example, but with missing observations, classification trees perform outstandingly well, whereas commercial discriminant analysis programs do not predict classifications for cases with incompletely measured predictor variables and generally are not designed to address the problem of missing data. Testing of data prior to analysis is necessary, and classification trees are recommended either as a replacement for LDA or as a supplement whenever data do not meet relevant assumptions. It is highly recommended as an alternative to LDA whenever the data set contains important cases with missing predictor variables. (C) 2002 Wiley-Liss, Inc.
引用
收藏
页码:257 / 275
页数:19
相关论文
共 41 条
[1]  
Aiello LC, 1999, AM J PHYS ANTHROPOL, V109, P89, DOI 10.1002/(SICI)1096-8644(199905)109:1<89::AID-AJPA8>3.0.CO
[2]  
2-4
[3]  
Anderson E., 1935, Bulletin of the American IRIS Society, V59, P2
[4]  
[Anonymous], 1999, APPL MULTIVARIATE AN
[5]  
[Anonymous], CART TREE STRUCTURED
[6]  
[Anonymous], 1979, Multivariate analysis
[7]  
[Anonymous], 1996, DESKTOP DATA ANAL SY
[8]  
[Anonymous], [No title captured], DOI DOI 10.1111/J.1467-842X.1984.TB01271.X
[9]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[10]   CANONICAL VARIATE ANALYSIS WITH UNEQUAL COVARIANCE MATRICES - GENERALIZATIONS OF THE USUAL SOLUTION [J].
CAMPBELL, NA .
JOURNAL OF THE INTERNATIONAL ASSOCIATION FOR MATHEMATICAL GEOLOGY, 1984, 16 (02) :109-124