THE IMPACT OF MORPHOLOGICAL VARIANTS ON A CLADISTIC HYPOTHESIS WITH AN EXAMPLE FROM A MYOLOGICAL DATA SET

被引：10

作者：

KESNER, MH

机构：

来源：

SYSTEMATIC BIOLOGY | 1994年 / 43卷 / 01期

关键词：

VARIANTS; MYOLOGY; CLADISTIC CHARACTERS; CLADISTIC METHODS; SAMPLE SIZE;

D O I：

10.2307/2413580

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

The larger the sample size per operational taxonomic unit (OTU), the higher the probability of identifying the correct character state for that OTU. Large numbers of characters are essential to assure the accuracy and stability of a phylogenetic reconstruction. Both large sample sizes and large numbers of characters are desirable, but given limited time and resources, cladistic analysis often involves a trade-off between the number of specimens one can study per OTU versus the number of characters one has time to discover. For a given data set, what balance of sample size and character number will yield an acceptably small error (the probability that a recognized clade is false) and yet achieve sufficient power (the probability of recognition of a valid clade)? Myological data compiled for a study of arvicoline rodents were manipulated to determine the effect of character number reduction, the effect of deliberate inclusion of incorrect character states (mistakes), and the interaction of these two effects in an attempt to provide one answer to this question. The results indicate that with sample sizes of four specimens per OTU (some of which were at supraspecific levels), the frequency of mistakes is sufficient to cause unacceptable increases in error and losses of power in resulting cladograms. However, the error and power of the resulting cladogram are more sensitive to reductions in character number than to the inclusion of an occasional incorrect character state. Beyond a minimal increase in sample size (to five specimens per OTU in this example), it is more advantageous to search for new characters than it is to increase the number of specimens surveyed. Because adequate sample size is a function of the taxonomic level of the study and the data structure of the characters, these specific results should not be broadly applied, but the methods used to determine the appropriate sample size for the arvicoline data are applicable to other studies. We must devote more effort to increasing the reliability of our data sets to assure highly reliable cladograms.

引用

页码：41 / 57

页数：17

共 49 条

[31] Impact of Environmental Protection Regulations on Corporate Performance From Porter Hypothesis Perspective: A Study Based on Publicly Listed Manufacturing Firms Data
Mu, Shaohong
Wang, Xianglu
Mohiuddin, Muhammad
FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10
[32] Prorocentrum rivalis sp. nov. (Dinophyceae) and its phylogenetic affinities inferred from analysis of a mixed morphological and LSU rRNA data set
David Delmail
Pascal Labrousse
Philippe Crassous
Philippe Hourdin
Mathieu Guri
Michel Botineau
Biologia, 2011, 66 : 418 - 424
[33] Impact of the Irish smoking ban on sales in bars using a large business-level data set from 1999 to 2007
Cornelsen, Laura
Normand, Charles
TOBACCO CONTROL, 2014, 23 (05) : 443 - 448
[34] Inference of disease associations with unmeasured genetic variants by combining results from genome-wide association studies with linkage disequilibrium patterns in a reference data set
David Hadley
David P Strachan
BMC Proceedings, 3 (Suppl 7)
[35] A reference data set of 5.4 million phased human variants validated by genetic inheritance from sequencing a three-generation 17-member pedigree
Eberle, Michael A.
Fritzilas, Epameinondas
Krusche, Peter
Kallberg, Morten
Moore, Benjamin L.
Bekritsky, Mitchell A.
Iqbal, Zamin
Chuang, Han-Yu
Humphray, Sean J.
Halpern, Aaron L.
Kruglyak, Semyon
Margulies, Elliott H.
McVean, Gil
Bentley, David R.
GENOME RESEARCH, 2017, 27 (01) : 157 - 164
[36] Combined impact of hypoalbuminemia and pharmacogenomic variants on voriconazole trough concentration: data from a real-life clinical setting in the Chinese population
Li, Yuanyuan
Zhang, Ying
Zhao, Jinxia
Bian, Jialu
Zhao, Yinyu
Hao, Xu
Liu, Boyu
Hu, Lei
Liu, Fang
Yang, Changqing
Feng, Yufei
Huang, Lin
JOURNAL OF CHEMOTHERAPY, 2024, 36 (03) : 179 - 189
[37] Impact of the Improper Adjustment for Age in Research on Age-Related Macular Degeneration: An Example Using Data from the Canadian Longitudinal Study on Aging
Grant, Alyssa
Colman, Ian
Freeman, Ellen E.
OPHTHALMIC EPIDEMIOLOGY, 2021, 28 (01) : 86 - 89
[38] Response to "MHC-dependent mate choice in humans: Why genomic patterns from the HapMap European American data set support the hypothesis" (DOI: 10.1002/bies.201100150)
Derti, Adnan
Roth, Frederick P.
BIOESSAYS, 2012, 34 (07) : 576 - 577
[39] Eclipse Impact on a Remote Sensing Data Set: PAL NDVI 10-Day Composite from February 11 to 20 in 1999 for Western Australia
Lim, Chai K.
REMOTE SENSING, 2010, 2 (08): : 1962 - 1972
[40] Physics-informed deep learning for scattered full wavefield reconstruction from a sparse set of sensor data for impact diagnosis in structural health monitoring
Zargar, Sakib Ashraf
Yuan, Fuh-Gwo
STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2024, 23 (05): : 2963 - 2979

← 1 2 3 4 5 →