Predicting habitat suitability with machine learning models:: The potential area of Pinus sylvestris L. in the Iberian Peninsula

被引:145
作者
Benito Garzon, Marta
Blazek, Radim
Neteler, Markus
Sanchez de Dios, Rut
Sainz Ollero, Helios
Furlanello, Cesare
机构
[1] Univ Autonoma Madrid, Dept Biol, Bot Unit, E-28049 Madrid, Spain
[2] ITC Irst, Predict Models Biol & Environm Data Anal, I-38050 Trento, Italy
关键词
machine learning; random forest; neural networks; classification and regression trees; AUC; kappa; Iberian Peninsula; Pinus sylvestris L; habitat suitability;
D O I
10.1016/j.ecolmodel.2006.03.015
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
We present a modelling framework for predicting forest areas. The framework is obtained by integrating a machine learning software suite within the GRASS Geographical Information System (GIS) and by providing additional methods for predictive habitat modelling. Three machine learning techniques (Tree-Based Classification, Neural Networks and Random Forest) are available in parallel for modelling from climatic and topographic variables. Model evaluation and parameter selection are measured by sensitivity-specificity ROC analysis, while the final presence and absence maps are obtained through maximisation of the kappa statistic. The modelling framework is applied at a resolution of 1km. with Iberian subpopulations of Pinus sylvestris L. forests. For this data set, the most accurate algorithm is Breiman's random forest, an ensemble method which provides automatic combination of tree-classifiers trained on bootstrapped subsamples and randomised variable sets. All models show a potential area of P.syluestris for the Iberian Peninsula which is larger than the present one, a result corroborated by regional pollen analyses. (c) 2006 Elsevier B.V All rights reserved.
引用
收藏
页码:383 / 393
页数:11
相关论文
共 82 条
  • [1] Evaluating predictive models of species' distributions: criteria for selecting optimal models
    Anderson, RP
    Lew, D
    Peterson, AT
    [J]. ECOLOGICAL MODELLING, 2003, 162 (03) : 211 - 232
  • [2] Anton MG, 1997, J BIOGEOGR, V24, P929, DOI 10.1046/j.1365-2699.1997.00181.x
  • [3] Exploring spatial vegetation dynamics using logistic regression and a multinomial logit model
    Augustin, NH
    Cummins, RP
    French, DD
    [J]. JOURNAL OF APPLIED ECOLOGY, 2001, 38 (05) : 991 - 1006
  • [4] Assessing effects of forecasted climate change on the diversity and distribution of European higher plants for 2050
    Bakkenes, M
    Alkemade, JRM
    Ihle, F
    Leemans, R
    Latour, JB
    [J]. GLOBAL CHANGE BIOLOGY, 2002, 8 (04) : 390 - 407
  • [5] Predicting species distributions: use of climatic parameters in BIOCLIM and its impact on predictions of species' current and future distributions
    Beaumont, LJ
    Hughes, L
    Poulsen, M
    [J]. ECOLOGICAL MODELLING, 2005, 186 (02) : 250 - 269
  • [6] BISHP C, 1995, NEURAL NETWORKS PATT
  • [7] Using the R statistical data analysis language on GRASS 5.0 GIS database files
    Bivand, RS
    [J]. COMPUTERS & GEOSCIENCES, 2000, 26 (9-10) : 1043 - 1052
  • [8] BIVAND RS, 2000, P 5 C GEOC U GREENW
  • [9] BIVAND RS, 2004, GRASS INTERFACE GRAS
  • [10] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669