Statistical methods for classifying genotypes

被引:76
作者
Crossa, J
Franco, J
机构
[1] Int Maize & Wheat Improvement Ctr CIMMYT, Biometr & Stat Unit, Mexico City 06600, DF, Mexico
[2] Univ Republ Oriental Uruguay, Fac Agron, Montevideo, Uruguay
关键词
categorical and continuous variables; cluster analysis; genetic resources; mixture models;
D O I
10.1023/B:EUPH.0000040500.86428.e8
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
In genetic resource conservation and plant breeding, multivariate data on continuous and categorical traits are collected with the objective of selecting genotypes and accessions that best represent the entire population or gene collection with the minimum loss of genetic diversity. Therefore, the best numerical classification strategy is the one that produces the most compact and well-separated groups, that is, minimum variability within each group and maximum variability among groups. In this study, we review geometric classification techniques as well as statistical models based on mixed distribution models. The two-stage sequential clustering strategy uses all variables, continuous and categorical, and it tends to form more homogeneous groups of individuals than other clustering strategies. The sequential clustering strategy can be applied to three-way data comprising genotypes x environments x attributes. This approach groups genotypes with consistent responses for most of the continuous and categorical traits across environments.
引用
收藏
页码:19 / 37
页数:19
相关论文
共 45 条
[1]  
Anderberg M.R., 1973, Probability and Mathematical Statistics
[2]  
[Anonymous], 1967, USNPRA TECH B
[3]  
[Anonymous], 1979, Multivariate analysis
[4]   THE MIXTURE METHOD OF CLUSTERING APPLIED TO 3-WAY DATA [J].
BASFORD, KE ;
MCLACHLAN, GJ .
JOURNAL OF CLASSIFICATION, 1985, 2 (01) :109-125
[5]  
BINDER DA, 1978, BIOMETRIKA, V65, P31, DOI 10.2307/2335273
[6]  
Calinski T., 1974, COMMUN STAT, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
[7]   CLASSIFICATION BASED ON DICHOTOMOUS AND CONTINUOUS VARIABLES [J].
CHANG, PC ;
AFIFI, AA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1974, 69 (346) :336-339
[8]  
Crossa J., 1990, Advances in Agronomy, V44, P55, DOI 10.1016/S0065-2113(08)60818-4
[9]  
DAY NE, 1969, BIOMETRIKA, V56, P463, DOI 10.1093/biomet/56.3.463
[10]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38