THE IMPACT OF MORPHOLOGICAL VARIANTS ON A CLADISTIC HYPOTHESIS WITH AN EXAMPLE FROM A MYOLOGICAL DATA SET

被引:10
|
作者
KESNER, MH
机构
关键词
VARIANTS; MYOLOGY; CLADISTIC CHARACTERS; CLADISTIC METHODS; SAMPLE SIZE;
D O I
10.2307/2413580
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The larger the sample size per operational taxonomic unit (OTU), the higher the probability of identifying the correct character state for that OTU. Large numbers of characters are essential to assure the accuracy and stability of a phylogenetic reconstruction. Both large sample sizes and large numbers of characters are desirable, but given limited time and resources, cladistic analysis often involves a trade-off between the number of specimens one can study per OTU versus the number of characters one has time to discover. For a given data set, what balance of sample size and character number will yield an acceptably small error (the probability that a recognized clade is false) and yet achieve sufficient power (the probability of recognition of a valid clade)? Myological data compiled for a study of arvicoline rodents were manipulated to determine the effect of character number reduction, the effect of deliberate inclusion of incorrect character states (mistakes), and the interaction of these two effects in an attempt to provide one answer to this question. The results indicate that with sample sizes of four specimens per OTU (some of which were at supraspecific levels), the frequency of mistakes is sufficient to cause unacceptable increases in error and losses of power in resulting cladograms. However, the error and power of the resulting cladogram are more sensitive to reductions in character number than to the inclusion of an occasional incorrect character state. Beyond a minimal increase in sample size (to five specimens per OTU in this example), it is more advantageous to search for new characters than it is to increase the number of specimens surveyed. Because adequate sample size is a function of the taxonomic level of the study and the data structure of the characters, these specific results should not be broadly applied, but the methods used to determine the appropriate sample size for the arvicoline data are applicable to other studies. We must devote more effort to increasing the reliability of our data sets to assure highly reliable cladograms.
引用
收藏
页码:41 / 57
页数:17
相关论文
共 49 条
  • [21] Multimodal distribution and its impact on the accurate assessment of spermatozoa morphological data: Lessons from machine learning
    Stefanovski, D.
    Schulze, M.
    Althouse, G. C.
    ANIMAL REPRODUCTION SCIENCE, 2024, 269
  • [22] Soft data, hard effects. Strategies for effective policy on health impact assessment - an example from the Netherlands
    den Broeder, L
    Penris, M
    Put, GV
    BULLETIN OF THE WORLD HEALTH ORGANIZATION, 2003, 81 (06) : 404 - 407
  • [23] Invasive breast cancer: stratification of histological grade by gene-based assays: a still relevant example from an older data set
    Dalton, Leslie
    HISTOPATHOLOGY, 2014, 65 (03) : 429 - 433
  • [24] Developing a pooled job physical exposure data set from multiple independent studies: an example of a consortium study of carpal tunnel syndrome
    Bao, Stephen S.
    Kapellusch, Jay M.
    Garg, Arun
    Silverstein, Barbara A.
    Harris-Adamson, Carisa
    Burt, Susan E.
    Dale, Ann Marie
    Evanoff, Bradley A.
    Gerr, Frederic E.
    Hegmann, Kurt T.
    Merlino, Linda A.
    Thiese, Matthew S.
    Rempel, David M.
    OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2015, 72 (02) : 130 - 137
  • [25] Does mortgage lending impact business credit? Evidence from a new disaggregated bank credit data set
    Bezemer, Dirk
    Samarina, Anna
    Zhang, Lu
    JOURNAL OF BANKING & FINANCE, 2020, 113
  • [26] Inferring Taxonomic Affinities and Genetic Distances Using Morphological Features Extracted from Specimen Images: A Case Study with a Bivalve Data Set
    Hofmann, Martin
    Kiel, Steffen
    Koesters, Lara M.
    Waeldchen, Jana
    Maeder, Patrick
    SYSTEMATIC BIOLOGY, 2024, 73 (06) : 920 - 940
  • [27] Prorocentrum rivalis sp nov (Dinophyceae) and its phylogenetic affinities inferred from analysis of a mixed morphological and LSU rRNA data set
    Delmail, David
    Labrousse, Pascal
    Crassous, Philippe
    Hourdin, Philippe
    Guri, Mathieu
    Botineau, Michel
    BIOLOGIA, 2011, 66 (03) : 418 - 424
  • [28] A scolopocryptopid centipede (Chilopoda: Scolopendromorpha) from Mexican amber: synchrotron microtomography and phylogenetic placement using a combined morphological and molecular data set
    Edgecombe, Gregory D.
    Vahtera, Varpu
    Stock, Stuart R.
    Kallonen, Aki
    Xiao, Xianghui
    Rack, Alexander
    Giribet, Gonzalo
    ZOOLOGICAL JOURNAL OF THE LINNEAN SOCIETY, 2012, 166 (04) : 768 - 786
  • [29] A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early-onset monogenic disorders in Indians
    Kausthubham, Neethukrishna
    Shukla, Anju
    Gupta, Neerja
    Bhavani, Gandham S.
    Kulshrestha, Samarth
    Bhowmik, Aneek Das
    Moirangthem, Amita
    Bijarnia-Mahay, Sunita
    Kabra, Madhulika
    Puri, Ratna D.
    Mandal, Kausik
    Verma, Ishwar C.
    Bielas, Stephanie L.
    Phadke, Shubha R.
    Dalal, Ashwin
    Girisha, Katta M.
    HUMAN MUTATION, 2021, 42 (04) : E15 - E61
  • [30] IMPACT OF DATA MATURITY ON THE ESTIMATION OF THE WITHIN-TRIAL HAZARD FUNCTION: AN EXAMPLE FROM METASTATIC CASTRATION RESISTANT PROSTATE CANCER
    Williams, J.
    Hettle, R.
    Haugli-Stephens, T.
    VALUE IN HEALTH, 2023, 26 (12) : S425 - S426