Estimating species diversity and distribution in the era of Big Data: to what extent can we trust public databases?

被引:272
作者
Maldonado, Carla [1 ,2 ]
Molina, Carlos I. [1 ,3 ]
Zizka, Alexander [4 ]
Persson, Claes [4 ]
Taylor, Charlotte M. [5 ]
Alban, Joaquina [6 ]
Chilquillo, Eder [6 ,7 ]
Ronsted, Nina [1 ]
Antonelli, Alexandre [4 ,8 ]
机构
[1] Univ Copenhagen, Nat Hist Museum Denmark, DK-1307 Copenhagen K, Denmark
[2] Univ Mayor San Andres, Herbario Nacl Bolivia, La Paz, Bolivia
[3] Univ Mayor San Andres, Inst Ecol, La Paz, Bolivia
[4] Univ Gothenburg, Dept Biol & Environm Sci, Gothenburg, Sweden
[5] Univ Missouri St Louis, Missouri Bot Garden, St Louis, MO USA
[6] Univ Nacl Mayor San Marcos, Museo Hist Nat, Lima 14, Peru
[7] Univ Estadual Campinas, Dept Biol Vegetal, Sao Paulo, Brazil
[8] Gothenburg Bot Garden, Gothenburg, Sweden
来源
GLOBAL ECOLOGY AND BIOGEOGRAPHY | 2015年 / 24卷 / 08期
基金
瑞典研究理事会; 欧洲研究理事会;
关键词
Cinchoneae; data quality; GBIF; occurrence data; Rubiaceae; species richness; SpeciesGeoCoder; BIODIVERSITY INFORMATICS; PLANT DIVERSITY; RICHNESS; PATTERNS; LIMITATIONS; ECOREGIONS; GENUS; MAP;
D O I
10.1111/geb.12326
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
AimMassive digitalization of natural history collections is now leading to a steep accumulation of publicly available species distribution data. However, taxonomic errors and geographical uncertainty of species occurrence records are now acknowledged by the scientific community - putting into question to what extent such data can be used to unveil correct patterns of biodiversity and distribution. We explore this question through quantitative and qualitative analyses of uncleaned versus manually verified datasets of species distribution records across different spatial scales. LocationThe American tropics. MethodsAs test case we used the plant tribe Cinchoneae (Rubiaceae). We compiled four datasets of species occurrences: one created manually and verified through classical taxonomic work, and the rest derived from GBIF under different cleaning and filling schemes. We used new bioinformatic tools to code species into grids, ecoregions, and biomes following WWF's classification. We analysed species richness and altitudinal ranges of the species. ResultsAltitudinal ranges for species and genera were correctly inferred even without manual data cleaning and filling. However, erroneous records affected spatial patterns of species richness. They led to an overestimation of species richness in certain areas outside the centres of diversity in the clade. The location of many of these areas comprised the geographical midpoint of countries and political subdivisions, assigned long after the specimens had been collected. Main conclusionOpen databases and integrative bioinformatic tools allow a rapid approximation of large-scale patterns of biodiversity across space and altitudinal ranges. We found that geographic inaccuracy affects diversity patterns more than taxonomic uncertainties, often leading to false positives, i.e. overestimating species richness in relatively species poor regions. Public databases for species distribution are valuable and should be more explored, but under scrutiny and validation by taxonomic experts. We suggest that database managers implement easy ways of community feedback on data quality.
引用
收藏
页码:973 / 984
页数:12
相关论文
共 49 条
[1]   Phylogeny of the tribe Cinchoneae (Rubiaceae), its position in Cinchonoideae, and description of a new genus, Ciliosemina [J].
Andersson, L ;
Antonelli, A .
TAXON, 2005, 54 (01) :17-28
[2]   TRIBES AND GENERA OF THE CINCHONEAE COMPLEX (RUBIACEAE) [J].
ANDERSSON, L .
ANNALS OF THE MISSOURI BOTANICAL GARDEN, 1995, 82 (03) :409-427
[3]  
[Anonymous], J BIOGEOGR
[4]  
[Anonymous], 2005, PRINCIPLES METHODS D
[5]  
[Anonymous], 1999, MONOGR SYST BOT MISS
[6]   Tracing the impact of the Andean uplift on Neotropical plant evolution [J].
Antonelli, Alexandre ;
Nylander, Johan A. A. ;
Persson, Claes ;
Sanmartin, Isabel .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (24) :9749-9754
[7]   Spatial bias in the GBIF database and its effect on modeling species' geographic distributions [J].
Beck, Jan ;
Boeller, Marianne ;
Erhardt, Andreas ;
Schwanghart, Wolfgang .
ECOLOGICAL INFORMATICS, 2014, 19 :10-15
[8]   What's on the horizon for macroecology? [J].
Beck, Jan ;
Ballesteros-Mejia, Liliana ;
Buchmann, Carsten M. ;
Dengler, Juergen ;
Fritz, Susanne A. ;
Gruber, Bernd ;
Hof, Christian ;
Jansen, Florian ;
Knapp, Sonja ;
Kreft, Holger ;
Schneider, Anne-Kathrin ;
Winter, Marten ;
Dormann, Carsten F. .
ECOGRAPHY, 2012, 35 (08) :673-683
[9]  
Burgess N., 2004, TERRESTRIAL ECOREGIO
[10]   Factoring species, non-species values and threats into biodiversity prioritisation across the ecoregions of Africa and its islands [J].
Burgess, ND ;
Hales, JD ;
Ricketts, TH ;
Dinerstein, E .
BIOLOGICAL CONSERVATION, 2006, 127 (04) :383-401