Estimating species diversity and distribution in the era of Big Data: to what extent can we trust public databases?

被引:281
作者
Maldonado, Carla [1 ,2 ]
Molina, Carlos I. [1 ,3 ]
Zizka, Alexander [4 ]
Persson, Claes [4 ]
Taylor, Charlotte M. [5 ]
Alban, Joaquina [6 ]
Chilquillo, Eder [6 ,7 ]
Ronsted, Nina [1 ]
Antonelli, Alexandre [4 ,8 ]
机构
[1] Univ Copenhagen, Nat Hist Museum Denmark, DK-1307 Copenhagen K, Denmark
[2] Univ Mayor San Andres, Herbario Nacl Bolivia, La Paz, Bolivia
[3] Univ Mayor San Andres, Inst Ecol, La Paz, Bolivia
[4] Univ Gothenburg, Dept Biol & Environm Sci, Gothenburg, Sweden
[5] Univ Missouri St Louis, Missouri Bot Garden, St Louis, MO USA
[6] Univ Nacl Mayor San Marcos, Museo Hist Nat, Lima 14, Peru
[7] Univ Estadual Campinas, Dept Biol Vegetal, Sao Paulo, Brazil
[8] Gothenburg Bot Garden, Gothenburg, Sweden
来源
GLOBAL ECOLOGY AND BIOGEOGRAPHY | 2015年 / 24卷 / 08期
基金
欧洲研究理事会; 瑞典研究理事会;
关键词
Cinchoneae; data quality; GBIF; occurrence data; Rubiaceae; species richness; SpeciesGeoCoder; BIODIVERSITY INFORMATICS; PLANT DIVERSITY; RICHNESS; PATTERNS; LIMITATIONS; ECOREGIONS; GENUS; MAP;
D O I
10.1111/geb.12326
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
AimMassive digitalization of natural history collections is now leading to a steep accumulation of publicly available species distribution data. However, taxonomic errors and geographical uncertainty of species occurrence records are now acknowledged by the scientific community - putting into question to what extent such data can be used to unveil correct patterns of biodiversity and distribution. We explore this question through quantitative and qualitative analyses of uncleaned versus manually verified datasets of species distribution records across different spatial scales. LocationThe American tropics. MethodsAs test case we used the plant tribe Cinchoneae (Rubiaceae). We compiled four datasets of species occurrences: one created manually and verified through classical taxonomic work, and the rest derived from GBIF under different cleaning and filling schemes. We used new bioinformatic tools to code species into grids, ecoregions, and biomes following WWF's classification. We analysed species richness and altitudinal ranges of the species. ResultsAltitudinal ranges for species and genera were correctly inferred even without manual data cleaning and filling. However, erroneous records affected spatial patterns of species richness. They led to an overestimation of species richness in certain areas outside the centres of diversity in the clade. The location of many of these areas comprised the geographical midpoint of countries and political subdivisions, assigned long after the specimens had been collected. Main conclusionOpen databases and integrative bioinformatic tools allow a rapid approximation of large-scale patterns of biodiversity across space and altitudinal ranges. We found that geographic inaccuracy affects diversity patterns more than taxonomic uncertainties, often leading to false positives, i.e. overestimating species richness in relatively species poor regions. Public databases for species distribution are valuable and should be more explored, but under scrutiny and validation by taxonomic experts. We suggest that database managers implement easy ways of community feedback on data quality.
引用
收藏
页码:973 / 984
页数:12
相关论文
共 49 条
[41]   Bringing collections out of the dark [J].
Smith, Vincent S. ;
Blagoderov, Vladimir .
ZOOKEYS, 2012, (209) :1-6
[42]   Biodiversity informatics:: managing and applying primary biodiversity data [J].
Soberón, J ;
Peterson, AT .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2004, 359 (1444) :689-698
[43]  
Topel M., 2014, BIORXIV009274
[44]   The point-radius method for georeferencing locality descriptions and calculating associated uncertainty [J].
Wieczorek, J ;
Guo, QG ;
Hijmans, RJ .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2004, 18 (08) :745-767
[45]  
Wikramanayake E.D., 2002, Terrestrial ecoregions of the Indo-Pacific: a conservation assessment, V3
[46]   A comparison of richness hotspots, rarity hotspots, and complementary areas for conserving diversity of British birds [J].
Williams, P ;
Gibbons, D ;
Margules, C ;
Rebelo, A ;
Humphries, C ;
Pressey, R .
CONSERVATION BIOLOGY, 1996, 10 (01) :155-174
[47]  
Winker K, 2004, BIOSCIENCE, V54, P455, DOI 10.1641/0006-3568(2004)054[0455:NHMIAP]2.0.CO
[48]  
2
[49]   How Global Is the Global Biodiversity Information Facility? [J].
Yesson, Chris ;
Brewer, Peter W. ;
Sutton, Tim ;
Caithness, Neil ;
Pahwa, Jaspreet S. ;
Burgess, Mikhaila ;
Gray, W. Alec ;
White, Richard J. ;
Jones, Andrew C. ;
Bisby, Frank A. ;
Culham, Alastair .
PLOS ONE, 2007, 2 (11)