ERROR-WEIGHTED MAXIMUM-LIKELIHOOD (EWML) - A NEW STATISTICALLY BASED METHOD TO CLUSTER QUANTITATIVE MICROPALEONTOLOGICAL DATA

被引:87
作者
FISHBEIN, E
PATTERSON, RT
机构
[1] CARLETON UNIV,OTTAWA CARLETON GEOSCI CTR,OTTAWA K1S 5B6,ONTARIO,CANADA
[2] CARLETON UNIV,DEPT EARTH SCI,OTTAWA K1S 5B6,ONTARIO,CANADA
关键词
D O I
10.1017/S0022336000036921
中图分类号
Q91 [古生物学];
学科分类号
0709 ; 070903 ;
摘要
The advent of readily available computer-based clustering packages has created some controversy in the micropaleontological community concerning the use and interpretation of computer-based biofacies discrimination. This is because dramatically different results can be obtained depending on methodology. The analysis of various clustering techniques reveals that, in most instances, no statistical hypothesis is contained in the clustering model and no basis exists for accepting one biofacies partitioning over another. Furthermore, most techniques do not consider standard error in species abundances and generate results that are not statistically relevant. When many rare species are present, statistically insignificant differences in rare species can accumulate and overshadow the significant differences in the major species, leading to biofacies containing members having little in common. A statistically based ''error-weighted maximum likelihood'' (EWML) clustering method is described that determines biofacies by assuming that samples from a common biofacies are normally distributed. Species variability is weighted to be inversely proportional to measurement uncertainty. The method has been applied to samples collected from the Fraser River Delta marsh and shows that five distinct biofacies can be resolved in the data. Similar results were obtained from readily available packages when the data set was preprocessed to reduce the number of degrees of freedom. Based on the sample results from the new algorithm, and on tests using a representative micropaleontological data set, a more conventional iterative processing method is recommended. This method, although not statistical in nature, produces similar results to EWML (not commercially available yet) with readily available analysis packages. Finally, some of the more common clustering techniques are discussed and strategies for their proper utilization are recommended.
引用
收藏
页码:475 / 486
页数:12
相关论文
共 35 条
[1]  
Abramowtiz M., 1972, HDB MATH FUNCTIONS F
[2]  
ANDERBERG MR, 1973, CLUSTER ANAL APPLICA
[3]  
Andersen H. V., 1953, CONTRIBUTIONS CUSHMA, V4, P20
[4]   LATE CENOZOIC BENTHONIC FORAMINIFERA OF NINETY EAST RIDGE (INDIAN-OCEAN) [J].
BOLTOVSKOY, E .
MARINE GEOLOGY, 1978, 26 (1-2) :139-175
[5]  
Brady H. B., 1870, ANN MAGAZINE NATUR 4, V4, P273
[6]  
BUZAS AM, 1979, SEPM SHORT COURSE, V6, P11
[7]  
BUZAS M. A., 1970, P N AM PALEONTOLOG B, P101
[8]   ANOTHER LOOK AT CONFIDENCE-LIMITS FOR SPECIES PROPORTIONS [J].
BUZAS, MA .
JOURNAL OF PALEONTOLOGY, 1990, 64 (05) :842-843
[9]  
COLE WS, 1931, B FLORIDA STATE GEOL, V6, P7
[10]   ASSESSING SIMILARITY BETWEEN PROFILES [J].
CRONBACH, LJ ;
GLESER, GC .
PSYCHOLOGICAL BULLETIN, 1953, 50 (06) :456-473