Using single- and multi-target regression trees and ensembles to model a compound index of vegetation condition

被引:157
作者
Kocev, Dragi [1 ]
Dzeroski, Saso [1 ]
White, Matt D. [2 ]
Newell, Graeme R. [2 ]
Griffioen, Peter [3 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Ljubljana 1000, Slovenia
[2] Arthur Rylah Inst Environm Res, Dept Sustainabil & Environm, Heidelberg, Vic 3084, Australia
[3] Acromap Pty Ltd, Heidelberg, Vic 3084, Australia
关键词
Multi-target prediction; Ensemble methods; Regression trees; Indigenous vegetation; Vegetation quality/condition;
D O I
10.1016/j.ecolmodel.2009.01.037
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
An important consideration in conservation and biodiversity planning is an appreciation of the condition or integrity of ecosystems. In this study, we have applied various machine learning methods to the problem of predicting the condition or quality of the remnant indigenous vegetation across an extensive area of south-eastern Australia-the state of Victoria. The field data were obtained using the 'habitat hectares' approach. This rapid assessment technique produces multiple scores that describe the condition of various attributes of the vegetation at a given site. Multiple sites were assessed and subsequently circumscribed with GIS and remote-sensed data. We explore and compare two approaches for modelling this type of data: to learn a model for each score separately (single-target approach, a regression tree), or to learn one model for all scores simultaneously (multi-target approach, a multi-target regression tree). In order to lift the predictive performance, we also employ ensembles (bagging and random forests) of regression trees and multi-target regression trees. Our results demonstrate the advantages of a multi-target over a single-target modelling approach. While there is no statistically significant difference between the multi-target and single-target models in terms of model performance, the multi-target models are smaller and faster to learn than the single-target ones. Ensembles of multi-target models, also, improve the spatial prediction of condition. The usefulness of models of vegetation condition is twofold. First, they provide an enhanced knowledge and understanding of the condition of different indigenous vegetation types, and identify possible biophysical and landscape attributes that may contribute to vegetation decline. Second, these models may be used to map the condition of indigenous vegetation, in support of biodiversity planning, management and investment decisions. (c) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1159 / 1168
页数:10
相关论文
共 37 条
[1]  
Andreasen James K., 2001, Ecological Indicators, V1, P21, DOI 10.1016/S1470-160X(01)00007-3
[2]  
[Anonymous], 2006, BIOCONDITION TERREST
[3]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[4]  
BLOCKEEL H, 2002, J MACHINE LEARNING R, V3, P621, DOI DOI 10.1162/JMLR.2003.3.4-5.621
[5]  
BLOCKEEL H, 1998, 15 INT C MACH LEARN, P55
[6]   ECOLOGICAL PRINCIPLES AND LAND RECLAMATION PRACTICE [J].
BRADSHAW, AD .
LANDSCAPE PLANNING, 1984, 11 (01) :35-48
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]  
Breiman L., 1984, Classification and regression trees, DOI DOI 10.1201/9781315139470
[10]  
Demsar J, 2006, J MACH LEARN RES, V7, P1