The Dawn of Open Access to Phylogenetic Data

被引:34
作者
Magee, Andrew F. [1 ]
May, Michael R. [1 ]
Moore, Brian R. [1 ]
机构
[1] Univ Calif Davis, Dept Ecol & Evolut, Davis, CA 95616 USA
来源
PLOS ONE | 2014年 / 9卷 / 10期
关键词
INFORMATION; AVAILABILITY; REUSE; TREE; NEED;
D O I
10.1371/journal.pone.0110268
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation - extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for similar to 60% of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data.
引用
收藏
页数:10
相关论文
共 53 条
  • [1] Public Availability of Published Research Data in High-Impact Journals
    Alsheikh-Ali, Alawi A.
    Qureshi, Waqas
    Al-Mallah, Mouaz H.
    Ioannidis, John P. A.
    [J]. PLOS ONE, 2011, 6 (09):
  • [2] A fair share
    不详
    [J]. NATURE, 2006, 444 (7120) : 653 - 654
  • [3] Class of Multiple Sequence Alignment Algorithm Affects Genomic Analysis
    Blackburne, Benjamin P.
    Whelan, Simon
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (03) : 642 - 653
  • [4] General methods for monitoring convergence of iterative simulations
    Brooks, SP
    Gelman, A
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (04) : 434 - 455
  • [5] PRIVATE ARCHIVES AND PUBLIC NEEDS
    CECI, SJ
    WALKER, E
    [J]. AMERICAN PSYCHOLOGIST, 1983, 38 (04) : 414 - 423
  • [6] Best Practices for Data Sharing in Phylogenetic Research
    Cranston, Karen
    Harmon, Luke J.
    O'Leary, Maureen A.
    Lisle, Curtis
    [J]. PLOS CURRENTS-TREE OF LIFE, 2014,
  • [7] A new age of discovery
    Donoghue, MJ
    Alverson, WS
    [J]. ANNALS OF THE MISSOURI BOTANICAL GARDEN, 2000, 87 (01) : 110 - 126
  • [8] Lost Branches on the Tree of Life
    Drew, Bryan T.
    Gazis, Romina
    Cabezas, Patricia
    Swithers, Kristen S.
    Deng, Jiabin
    Rodriguez, Roseana
    Katz, Laura A.
    Crandall, Keith A.
    Hibbett, David S.
    Soltis, Douglas E.
    [J]. PLOS BIOLOGY, 2013, 11 (09)
  • [9] Missing data mean holes in tree of life
    Drew, Bryan T.
    [J]. NATURE, 2013, 493 (7432) : 305 - 305
  • [10] Drummond AJ, 2005, MOL BIOL EVOL, V22, P1185, DOI [10.1093/molbev/msi103, 10.1093/molbev/mss075]