THE IMPACT OF MORPHOLOGICAL VARIANTS ON A CLADISTIC HYPOTHESIS WITH AN EXAMPLE FROM A MYOLOGICAL DATA SET

被引：10

作者：

KESNER, MH

机构：

来源：

SYSTEMATIC BIOLOGY | 1994年 / 43卷 / 01期

关键词：

VARIANTS; MYOLOGY; CLADISTIC CHARACTERS; CLADISTIC METHODS; SAMPLE SIZE;

D O I：

10.2307/2413580

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

The larger the sample size per operational taxonomic unit (OTU), the higher the probability of identifying the correct character state for that OTU. Large numbers of characters are essential to assure the accuracy and stability of a phylogenetic reconstruction. Both large sample sizes and large numbers of characters are desirable, but given limited time and resources, cladistic analysis often involves a trade-off between the number of specimens one can study per OTU versus the number of characters one has time to discover. For a given data set, what balance of sample size and character number will yield an acceptably small error (the probability that a recognized clade is false) and yet achieve sufficient power (the probability of recognition of a valid clade)? Myological data compiled for a study of arvicoline rodents were manipulated to determine the effect of character number reduction, the effect of deliberate inclusion of incorrect character states (mistakes), and the interaction of these two effects in an attempt to provide one answer to this question. The results indicate that with sample sizes of four specimens per OTU (some of which were at supraspecific levels), the frequency of mistakes is sufficient to cause unacceptable increases in error and losses of power in resulting cladograms. However, the error and power of the resulting cladogram are more sensitive to reductions in character number than to the inclusion of an occasional incorrect character state. Beyond a minimal increase in sample size (to five specimens per OTU in this example), it is more advantageous to search for new characters than it is to increase the number of specimens surveyed. Because adequate sample size is a function of the taxonomic level of the study and the data structure of the characters, these specific results should not be broadly applied, but the methods used to determine the appropriate sample size for the arvicoline data are applicable to other studies. We must devote more effort to increasing the reliability of our data sets to assure highly reliable cladograms.

引用

页码：41 / 57

页数：17

共 49 条

[21] Multimodal distribution and its impact on the accurate assessment of spermatozoa morphological data: Lessons from machine learning
Stefanovski, D.
Schulze, M.
Althouse, G. C.
ANIMAL REPRODUCTION SCIENCE, 2024, 269
[22] Soft data, hard effects. Strategies for effective policy on health impact assessment - an example from the Netherlands
den Broeder, L
Penris, M
Put, GV
BULLETIN OF THE WORLD HEALTH ORGANIZATION, 2003, 81 (06) : 404 - 407
[23] Invasive breast cancer: stratification of histological grade by gene-based assays: a still relevant example from an older data set
Dalton, Leslie
HISTOPATHOLOGY, 2014, 65 (03) : 429 - 433
[24] Developing a pooled job physical exposure data set from multiple independent studies: an example of a consortium study of carpal tunnel syndrome
Bao, Stephen S.
Kapellusch, Jay M.
Garg, Arun
Silverstein, Barbara A.
Harris-Adamson, Carisa
Burt, Susan E.
Dale, Ann Marie
Evanoff, Bradley A.
Gerr, Frederic E.
Hegmann, Kurt T.
Merlino, Linda A.
Thiese, Matthew S.
Rempel, David M.
OCCUPATIONAL AND ENVIRONMENTAL MEDICINE, 2015, 72 (02) : 130 - 137
[25] Does mortgage lending impact business credit? Evidence from a new disaggregated bank credit data set
Bezemer, Dirk
Samarina, Anna
Zhang, Lu
JOURNAL OF BANKING & FINANCE, 2020, 113
[26] Inferring Taxonomic Affinities and Genetic Distances Using Morphological Features Extracted from Specimen Images: A Case Study with a Bivalve Data Set
Hofmann, Martin
Kiel, Steffen
Koesters, Lara M.
Waeldchen, Jana
Maeder, Patrick
SYSTEMATIC BIOLOGY, 2024, 73 (06) : 920 - 940
[27] Prorocentrum rivalis sp nov (Dinophyceae) and its phylogenetic affinities inferred from analysis of a mixed morphological and LSU rRNA data set
Delmail, David
Labrousse, Pascal
Crassous, Philippe
Hourdin, Philippe
Guri, Mathieu
Botineau, Michel
BIOLOGIA, 2011, 66 (03) : 418 - 424
[28] A scolopocryptopid centipede (Chilopoda: Scolopendromorpha) from Mexican amber: synchrotron microtomography and phylogenetic placement using a combined morphological and molecular data set
Edgecombe, Gregory D.
Vahtera, Varpu
Stock, Stuart R.
Kallonen, Aki
Xiao, Xianghui
Rack, Alexander
Giribet, Gonzalo
ZOOLOGICAL JOURNAL OF THE LINNEAN SOCIETY, 2012, 166 (04) : 768 - 786
[29] A data set of variants derived from 1455 clinical and research exomes is efficient in variant prioritization for early-onset monogenic disorders in Indians
Kausthubham, Neethukrishna
Shukla, Anju
Gupta, Neerja
Bhavani, Gandham S.
Kulshrestha, Samarth
Bhowmik, Aneek Das
Moirangthem, Amita
Bijarnia-Mahay, Sunita
Kabra, Madhulika
Puri, Ratna D.
Mandal, Kausik
Verma, Ishwar C.
Bielas, Stephanie L.
Phadke, Shubha R.
Dalal, Ashwin
Girisha, Katta M.
HUMAN MUTATION, 2021, 42 (04) : E15 - E61
[30] IMPACT OF DATA MATURITY ON THE ESTIMATION OF THE WITHIN-TRIAL HAZARD FUNCTION: AN EXAMPLE FROM METASTATIC CASTRATION RESISTANT PROSTATE CANCER
Williams, J.
Hettle, R.
Haugli-Stephens, T.
VALUE IN HEALTH, 2023, 26 (12) : S425 - S426

← 1 2 3 4 5 →