THE IMPACT OF MORPHOLOGICAL VARIANTS ON A CLADISTIC HYPOTHESIS WITH AN EXAMPLE FROM A MYOLOGICAL DATA SET

被引:10
|
作者
KESNER, MH
机构
关键词
VARIANTS; MYOLOGY; CLADISTIC CHARACTERS; CLADISTIC METHODS; SAMPLE SIZE;
D O I
10.2307/2413580
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The larger the sample size per operational taxonomic unit (OTU), the higher the probability of identifying the correct character state for that OTU. Large numbers of characters are essential to assure the accuracy and stability of a phylogenetic reconstruction. Both large sample sizes and large numbers of characters are desirable, but given limited time and resources, cladistic analysis often involves a trade-off between the number of specimens one can study per OTU versus the number of characters one has time to discover. For a given data set, what balance of sample size and character number will yield an acceptably small error (the probability that a recognized clade is false) and yet achieve sufficient power (the probability of recognition of a valid clade)? Myological data compiled for a study of arvicoline rodents were manipulated to determine the effect of character number reduction, the effect of deliberate inclusion of incorrect character states (mistakes), and the interaction of these two effects in an attempt to provide one answer to this question. The results indicate that with sample sizes of four specimens per OTU (some of which were at supraspecific levels), the frequency of mistakes is sufficient to cause unacceptable increases in error and losses of power in resulting cladograms. However, the error and power of the resulting cladogram are more sensitive to reductions in character number than to the inclusion of an occasional incorrect character state. Beyond a minimal increase in sample size (to five specimens per OTU in this example), it is more advantageous to search for new characters than it is to increase the number of specimens surveyed. Because adequate sample size is a function of the taxonomic level of the study and the data structure of the characters, these specific results should not be broadly applied, but the methods used to determine the appropriate sample size for the arvicoline data are applicable to other studies. We must devote more effort to increasing the reliability of our data sets to assure highly reliable cladograms.
引用
收藏
页码:41 / 57
页数:17
相关论文
共 49 条
  • [31] Impact of Environmental Protection Regulations on Corporate Performance From Porter Hypothesis Perspective: A Study Based on Publicly Listed Manufacturing Firms Data
    Mu, Shaohong
    Wang, Xianglu
    Mohiuddin, Muhammad
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10
  • [32] Prorocentrum rivalis sp. nov. (Dinophyceae) and its phylogenetic affinities inferred from analysis of a mixed morphological and LSU rRNA data set
    David Delmail
    Pascal Labrousse
    Philippe Crassous
    Philippe Hourdin
    Mathieu Guri
    Michel Botineau
    Biologia, 2011, 66 : 418 - 424
  • [33] Impact of the Irish smoking ban on sales in bars using a large business-level data set from 1999 to 2007
    Cornelsen, Laura
    Normand, Charles
    TOBACCO CONTROL, 2014, 23 (05) : 443 - 448
  • [34] Inference of disease associations with unmeasured genetic variants by combining results from genome-wide association studies with linkage disequilibrium patterns in a reference data set
    David Hadley
    David P Strachan
    BMC Proceedings, 3 (Suppl 7)
  • [35] A reference data set of 5.4 million phased human variants validated by genetic inheritance from sequencing a three-generation 17-member pedigree
    Eberle, Michael A.
    Fritzilas, Epameinondas
    Krusche, Peter
    Kallberg, Morten
    Moore, Benjamin L.
    Bekritsky, Mitchell A.
    Iqbal, Zamin
    Chuang, Han-Yu
    Humphray, Sean J.
    Halpern, Aaron L.
    Kruglyak, Semyon
    Margulies, Elliott H.
    McVean, Gil
    Bentley, David R.
    GENOME RESEARCH, 2017, 27 (01) : 157 - 164
  • [36] Combined impact of hypoalbuminemia and pharmacogenomic variants on voriconazole trough concentration: data from a real-life clinical setting in the Chinese population
    Li, Yuanyuan
    Zhang, Ying
    Zhao, Jinxia
    Bian, Jialu
    Zhao, Yinyu
    Hao, Xu
    Liu, Boyu
    Hu, Lei
    Liu, Fang
    Yang, Changqing
    Feng, Yufei
    Huang, Lin
    JOURNAL OF CHEMOTHERAPY, 2024, 36 (03) : 179 - 189
  • [37] Impact of the Improper Adjustment for Age in Research on Age-Related Macular Degeneration: An Example Using Data from the Canadian Longitudinal Study on Aging
    Grant, Alyssa
    Colman, Ian
    Freeman, Ellen E.
    OPHTHALMIC EPIDEMIOLOGY, 2021, 28 (01) : 86 - 89
  • [38] Response to "MHC-dependent mate choice in humans: Why genomic patterns from the HapMap European American data set support the hypothesis" (DOI: 10.1002/bies.201100150)
    Derti, Adnan
    Roth, Frederick P.
    BIOESSAYS, 2012, 34 (07) : 576 - 577
  • [39] Eclipse Impact on a Remote Sensing Data Set: PAL NDVI 10-Day Composite from February 11 to 20 in 1999 for Western Australia
    Lim, Chai K.
    REMOTE SENSING, 2010, 2 (08): : 1962 - 1972
  • [40] Physics-informed deep learning for scattered full wavefield reconstruction from a sparse set of sensor data for impact diagnosis in structural health monitoring
    Zargar, Sakib Ashraf
    Yuan, Fuh-Gwo
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2024, 23 (05): : 2963 - 2979