On goodness-of-fit measure for dendrogram-based analyses

被引:52
作者
Merigot, Bastien [1 ]
Durbec, Jean-Pierre [1 ]
Gaertner, Jean-Claude [1 ]
机构
[1] Univ Aix Marseille 2, Ctr Oceanol Marseille, CNRS, LMGEM,UMR 6117, FR-13009 Marseille, France
关键词
clustering; dendrogram; functional classification; functional diversity; matrix norm; FUNCTIONAL-DIVERSITY INDEXES; COPHENETIC CORRELATION; COMMUNITY STRUCTURE; GENETIC DIVERSITY; SPECIES RICHNESS; VARIABILITY; VARIABLES; MATRICES; SOIL;
D O I
10.1890/09-1387.1
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Clustering methods are widely used tools in many aspects of science, such as ecology, medicine, or even market research, that commonly deal with dendrogram-based analyses. In such analyses, for a given initial dissimilarity matrix, the resulting dendrogram may strongly vary according to the selected clustering methods. However, numerous dendrogram-based analyses require adequate measurement for assessing of which of the clustering methods preserves most faithfully the initial dissimilarity matrix. While cophenetic correlation coefficient-based measures have been widely used for this purpose, we emphasize here that it is not always a suitable approach. We thus propose a measure based on a matrix norm, the 2-norm, to adequately check which of the resulting ultrametric distance matrices related to the dendrograms is the closest to the initial dissimilarity matrix. In addition, we also propose an objective way to define a benchmark value (threshold value) in order to assess whether the degree of conformity between the ultrametric distance matrix selected and the initial dissimilarity matrix is satisfactory. Our proposal may notably be incorporated within a recently proposed approach that involves the use of clustering methods in environmental science and beyond. In ecology, various functional diversity indices based on clustering species from their functional dissimilarities may benefit from this overall approach.
引用
收藏
页码:1850 / 1859
页数:10
相关论文
共 39 条
[1]  
Achlioptas D, 2004, LECT NOTES ARTIF INT, V3202, P1
[2]  
[Anonymous], 2000, Principles of multivariate analysis
[3]   Functional diversity of mammalian predators and extinction in island birds [J].
Blackburn, TM ;
Petchey, OL ;
Cassey, P ;
Gaston, KJ .
ECOLOGY, 2005, 86 (11) :2916-2923
[4]   A comparison of clustering methods for river benthic community analysis [J].
Yong Cao ;
, Anthony W. Bark ;
W. Peter Williams .
Hydrobiologia, 1997, 347 (1-3) :24-40
[5]   Including intraspecific variability in functional diversity [J].
Cianciaruso, M. V. ;
Batalha, M. A. ;
Gaston, K. J. ;
Petchey, O. L. .
ECOLOGY, 2009, 90 (01) :81-89
[6]   TREATMENT OF VECTOR VARIABLES [J].
ESCOUFIER, Y .
BIOMETRICS, 1973, 29 (04) :751-760
[7]  
Everitt B. S., 2001, CLUSTER ANAL
[8]   ON COPHENETIC CORRELATION COEFFICIENT [J].
FARRIS, JS .
SYSTEMATIC ZOOLOGY, 1969, 18 (03) :279-&
[9]   Loss of functional diversity under land use intensification across multiple taxa [J].
Flynn, Dan F. B. ;
Gogol-Prokurat, Melanie ;
Nogeire, Theresa ;
Molinari, Nicole ;
Richers, Barbara Trautman ;
Lin, Brenda B. ;
Simpson, Nicholas ;
Mayfield, Margaret M. ;
DeClerck, Fabrice .
ECOLOGY LETTERS, 2009, 12 (01) :22-33
[10]  
Golub G. H., 1996, MATRIX COMPUTATIONS